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NOVEL BAG PROTEINS AND 
NUCLEIC ACID MOLECULES ENCODING THEM 

STATEMENT AS TO RIGHTS TO INVENTIONS MADE 
UNDER FEDERALLY- SPONSORED RESEA RCH AND DEVELOPMENT 

5 This invention was made with government support 

under grant number CA-67329 awarded by the National 
Institutes of Health. The United States Government has 
certain rights in this invention. 

BACKGROUND OF THE INVENTION 

10 FT ELD OF THE INVENTION 

This invention relates generally to the fields of 
molecular biology and molecular medicine and more 
specifically to a novel family of proteins that can 
regulate protein folding. The functions of these proteins 
15 are potentially diverse, including promoting tumor cell 
growth and metastasis. 

BACKGROUND INFORMATION 

The Hsc70/Hsp70-f amily of molecular chaperones 
participate in protein folding reactions, controlling 

20 protein bioactivity, degradation, complex 

assembly/disassembly, and translocation across membranes. 
These proteins interact with hydrophobic regions within 
target proteins via a carboxyl (C) -terminal peptide binding 
domain, with substrate binding and release being controlled 

25 by the N-terminal ATP-binding domain of Hsc70/Hsp70. 
Hsc7C/Hsp70-assisted folding reactions are accomplished by 
repeated cycles of peptide binding, refolding, and release, 
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which are coupled to ATP hydrolysis by the ATP-binding 
domain (ATPase) of Hsc70/Hsp70 and by subsequent nucleotide 
exchange. The chaperone activity of mammalian Hsc70/Hsp70 
is regulated by partner proteins that either modulate the 
5 peptide binding cycle or that target the actions of these 
chaperones to specific proteins and subcellular 
compartments. DnaJ-family proteins ( Hd j - 1 / H sp4 0 ; Hdj-2; 
Hdj-3) stimulate the ATPase activity of Hs c7 0 / Hsp7 0 , 
resulting in the ADP-bound state which binds tightly to 

10 peptide substrates. The Hip protein collaborates with 
Hsc70/Hsp70 and DnaJ homologues in stimulating ATP 
hydrolysis, and thus also stabilize Hsc70/Hsp70 complexes 
with substrate polypeptides, whereas the Hop protein may 
provide co-chaperone functions through interactions with 

15 the C-terminal peptide binding domain. 



The Bcl-2 associated athanogene-1 (bag-1) is 
named from the Greek word athanos , which refers to 
anti-cell death. BAG-1 was previously referred to as 
Bel -2 -associated protein-1 (BAP-1) in U.S. Patent No. 

20 5,539,094 issued July 23, 1996, which is incorporated 
herein by reference. In this earlier patent, BAG-1 is 
described as a portion of the human BAG-1 protein, absent 
the N-terminal amino acids 1 to 85. In addition, a human 
protein essentially identical to human BAG-1 was described 

25 by Zeiner and Gehring, {Proc. Natl, Acad, Sci . , USA 
92:11465-114 69 (1995)). Subsequent to the issuance of U.S. 
Patent 5,539,094 the N-terminal amino acid sequence from 1 
to 85 of human BAG-1 was reported. 



BAG-1 and its longer isoforms BAG- 1M (Rap46) and 
30 BAG- 1L are recently described Hsc7 0 /Hsp7 0-regulating 
proteins. BAG-1 competes with Hip for binding to the 
Hsc70/Hsp70 ATPase domain and promotes substrate release. 
BAG-1 also reportedly stimulates Hsc7 0-media ted ATP 
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hydrolysis by accelerating ADP/ATP exchange, analogous to 
the prokaryotic GrpE nucleotide exchange protein of the 
bacterial Hsc70 homologue, DnaK . Gene transfection studies 
indicate that BAG- 1 proteins can influence a wide variety 
5 of cellular phenotypes through their interactions with 
Hsc70/Hsp70, including increasing resistance to apoptosis, 
promoting cell proliferation, enhancing tumor cell 
migration and metastasis, and altering transcriptional 
activity of steroid hormone receptors. 



remains an unmet need for the further identification and 
isolation of additional homologous BAG protein species, and 
the nucleic acid molecules and/or nucleotide sequences 
that encode them. Such species would provide additional 

15 means by which the identity and composition of the BAG 
domain, that is, the portion of the protein that is 
influencing or modulating protein folding, could be 
identified. In addition, such species would be useful for 
identifying agents that modulate apoptosis as candidates 

20 for therapeutic agents, in particular, anticancer agents. 
The present invention satisfies these need, as well as 
providing substantial related advantages. 



25 related proteins from humans [BAG-1L (SEQ ID NO:2), BAG-1 
(beginning at residue 116 of SEQ ID NO:2), BAG-2 (SEQ ID 
NO: 4), BAG- 3 (SEQ ID NO : 6 ) and (SEQ ID NO:20), BAG- 4 (SEQ 
ID NO:8) and (SEQ ID NO:22) and BAG-5 (SEQ ID NO:10) and 
(SEQ ID NO:24)] , the invertebrate C.elegans [BAG-1 (SEQ ID 

30 NO: 12), BAG-2 (SEQ ID NO: 14)] and the fission yeast S . pombe 
[BAG-1A (SEQ ID NO:16), BAG-IB (SEQ ID N0:18)] and the 
nucleic acid molecules that encode them. 



10 



Despite the notable progress in the art, there 



SUMMARY OF THE INVENTION 



The present invention provides a family of BAG-1 
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Another aspect of the present invention provides 
an amino acid sequence present in the family of BAG- 1 
related proteins, that modulates Hsc70/Hsp70 chaperone 
activity, that is, the BAG domain. 

5 Another aspect of the present invention provides 

novel polypeptide and nucleic acid compositions and methods 
useful in modulating Hsc70/Hsp70 chaperone activity. 

Another aspect of the present invention is 
directed to methods for detecting agents that modulate the 
10 binding of the BAG family of proteins, such as BAG-i 
(beginning at residue 116 of SEQ ID N0:2), and related 
proteins with the Hsc70/Hsp70 Family of proteins or with 
other proteins that may interact with the BAG-Family 
proteins . 

15 Still another aspect of the present invention is 

directed to methods for detecting agents that induce the 
dissociation of a bound complex formed by the association 
of BAG-Family proteins with Hsc70/Hsp70 Family molecule 
chaperones or other proteins. 

2 0 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the full length cDNA sequence for 
human BAG- 1 (SEQ ID NO : 1 ) protein with the corresponding 
amino acid sequence (SEQ ID NO:2) . Within the full length 
sequence are included the overlapping sub- sequences of 
25 BAG - 1 (beginning at nucleotide 391), BAG- 1M [beginning at 
nucleotide 260 of (SEQ ID N0:2)], and BAG-1L [beginning at 
nucleotide 46 of (SEQ ID NO : 2 ) ] . 
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Figures 2A and 2B combined shows the full length 
cDNA sequence (SEQ ID NO: 3) aligned with the corresponding 
amino acid residues for human BAG-2 protein (SEQ ID NO : 4 ) . 

Figure 3 shows a cDNA sequence (SEQ ID NC:5) 
5 aligned with the corresponding amino acid residues for 
human BAG- 3 protein (SEQ ID NO : 6 ) . 

Figure 4 shows the a cDNA sequence (SEQ ID NO : 7 ) 
aligned with the corresponding amino acid residues for 
human BAG- 4 protein (SEQ ID NO : 8 ) . 

10 Figure 5 shows a cDNA sequence (SEQ ID NO: 9) 

aligned with the corresponding amino acid residues for 
human BAG-5 protein (SEQ ID NO: 10). 

Figure 6A shows the full length cDNA sequence for 
C. elegans BAG-1 protein (SEQ ID NO:ll). 

15 Figure 6B shows the 210 amino acid sequence for 

C. elegans BAG-1 protein (SEQ ID NO:12). 

Figure 7A shows the full length cDNA sequence for 
C. elegans BAG-2 protein (SEQ ID NO:13). 

Figure 7B shows the 458 amino acid sequence for 
20 C. elegans BAG-2 protein (SEQ ID NO:14). 

Figure 8A shows the full length cDNA sequence for 
S. pombe BAG- 1 A protein (SEQ ID NO:15). 

Figure 8B shows the 195 amino acid sequence for 
S. pomJbe BAG- 1A protein (SEQ ID NO:16). 
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Figure 9A shows the full length cDNA sequence for 
S. pombe BAG-IB protein (SEQ ID NO: 17) . 



Figure 9B shows the 206 amino acid sequence for 
S. pombe BAG-IB protein (SEQ ID NO: 18) . 

5 Figure 10 shows the topologies of the BAG-family 

proteins; human BAG proteins, BAG- 1 ( SEQ ID NO:2), BAG -2 
(SEQ ID NO:4), BAG- 3 (SEQ ID NO : 6 ) , BAG-4 (SEQ ID NO : 8 ) , 
BAG- 5 (SEQ ID NO:10); S . pombe BAG- 1 A (SEQ ID NO:16)and 
BAG-IB (SEQ ID NO:18); and C. elegans BAG- 1 (SEQ ID 

10 NO:12)and BAG-2 (SEQ ID NO:14). (A) The relative 

positions of the BAG domains are shown in black, ubiquitin- 
like regions are represented in gray, WW domain are 
represented in strips. Nucleoplasmin-like nuclear 

localization sequence are also shown. (B) The amino acid 

15 sequences of the BAG domain for human BAG- 1 (SEQ ID NO:2), 
BAG-2 (SEQ ID NO : 4 ) , BAG-3 (SEQ ID NO : 6 ) , BAG-4 (SEQ ID 
NO:8), BAG-5 (SEQ ID NO:10), 5. pombe BAG- 1 A (SEQ ID 
NO:16)and BAG-IB (SEQ ID NO: 18), and C. elegans BAG-1 (SEQ 
ID NO:12)and BAG-2 (SEQ ID NO:14) are aligned demonstrating 

20 their homology. Black and gray shading represent identical 
and similar amino acids, respectively. 



Figure 11 shows assays demonstrating the 
interaction of BAG-family proteins with Ksc7 0 /ATPase . (A) 
Two-hybrid assays using yeast expressing the indicated 

25 fusion proteins. Blue color indicates a positive 

interaction, resulting in activation of the lacZ reporter 
gene. (B) In vitro protein assays using GST-fusion 

proteins and 3b S-labeled in vitro translated proteins. (C) 
Co-immunoprecipitat ion assays using anti-Flag or IgGl 

30 control antibodies and lysates from 293T ceils expressing 
Flag-tagged BAG-1 (beginning at residue 116 of SEQ ID 
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NO:2), BAG-2 (SEQ ID NO : 4 ) , BAG- 3 ( SEQ ID NO : 6 ) , Daxx, or 
Apaf-1 . 

Figure 12 shows surface plasmon resonance 
analysis of BAG -family protein interactions with 
5 Hsc7 0 /ATPase . (A) SDS-PAGE analysis of purified 

recombinant proteins. (B) Representative SPR results of 
biosensor chips containing immobilized BAG proteins with 
and without maximally bound Hsc7 0 /ATPase . 

Figure 13 shows representative SPR results for 
10 biosensor chips containing immobilized BAG-1 (beginning at 
residue 116 at SEQ ID NO : 2 ) , BAG-1 (aC) , BAG -2 (SEQ ID NO : 4 ) , 
or BAG- 3 (SEQ ID NO: 6) proteins. Hsc70/ATPase was flowed 
over the chips (arrow/left) until maximal binding was 
reached (response units), then flow was continued without 
15 Hsc70/ATPase (arrow/right) . For BAG- 2 (SEQ ID NO:4) and 
BAG- 3 (SEQ ID NO:6), Hsc70 was injected at 0.0175, 0.035, 
0.07, 0.14, and 0.28 pM . 

Figure 14 shows BAG-family protein modulation of 
Hsc70 chaperone activity. (A) Protein refolding assay of 

20 chemically-denatured luciferase by Hsc70 plus DnaJ in the 
absence or presence of BAG and BAG-mutant proteins. (B) 
Concentration -dependent inhibition of Hsc7 0 -media ted 
protein refolding by BAG-family proteins [BAG-1 (beginning 
at residue 116 of SEQ ID NO:2), BAG -2 (SEQ ID NO:4), BAG -3 

25 (SEQ ID NO:6)] but not by BAG-mutant (BAG-1 (AC). (C) 
Hsc70/Hsp4 0-mediated refolding of heat-denatured luciferase 
was assayed in the presence of (black bars) or absence of 
(striped bars) of 1.8 uM Hip, with (lanes 3-10) or without 
(lanes 1,2) various BAG-family proteins (1.8pM) as 

30 indicated (mean ±SE; n=3). A control (CNTL) is shown (lane 
1) in which Hsc70 was replaced with an equivalent amount of 
BSA. 
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Figure 15A shows an expanded cDNA sequence for 
human BAG- 3 protein (SEQ ID NO:19). 

Figure 15B shows the corresponding amino acid 
residues for the human BAG-3 protein (SEQ ID NO:20) of 
5 Figure 15A . 

Figure 15C shows the expanded cDNA sequence (SEQ 
ID NO: 19) aligned with the corresponding amino acid 
residues for human BAG-3 protein of Figure 15A (SEQ ID 
NO: 20) . 

10 Figure 1 6A shows an expanded cDNA sequence for 

human BAG- 4 protein (SEQ ID NO:21). 

Figure 16B shows the corresponding amino acid 
residues for the human BAG- 4 protein of Figure 16A (SEQ ID 
NO:22) . 

15 Figure 16C shows the expanded cDNA sequence (SEQ 

ID NO:21) aligned with the corresponding amino acid 
residues for human BAG- 4 protein of Figure 1 6A ( SEQ ID 
NO:22 ) . 

Figure 17A shows an expanded cDNA sequence for 
20 human BAG- 5 protein (SEQ ID NO:23). 

Figure 17B shows the corresponding amino acid 
residues for the human BAG-5 protein of Figure 17A ( SEQ ID 
NO: 24 ) . 

Figure 17C shows the expanded cDNA sequence (SEQ 
25 ID NO: 23) aligned with the corresponding amino acid 
residues for human BAG-5 protein of Figure 17A (SEQ ID 
NO: 24 ) . 
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Figure 18 shows the topologies of the BAG -family 
proteins; human BAG proteins, BAG- 1 (SEQ ID N0:2), BAG-2 
(SEQ ID NO:4), expanded BAG-3 (SEQ ID NO:20), expanded 
3AG-4 (SEQ ID NO:22), expanded BAG - 5 (SEQ ID NO:24); 
5 S.pombe BAG-1A (SEQ ID NO:16)and BAG-IB (SEQ ID NO:18); and 
C. elegans BAG- 1 (SEQ ID NO:12)and BAG-2 (SEQ ID NO:14). 
The relative positions of the BAG domains are shown in 
black, ubiquitin-like regions are represented in gray, WW 
domain are represented in strips. Nucleoplasmin-like 
10 nuclear localization sequence are also shown. 

Definitions 

The term "apoptosis", as used herein, refers to 
the process of programmed cell death, although not all 
programmed cell deaths occur through apoptosis, as used 
15 herein, "apoptosis" and "programmed cell death" are used 
interchangeably . 

The term "tumor cell proliferation", as used 
herein refers to the ability of tumor cells to grow and 
thus expand a tumor mass. 

20 The term "cell migration", as used herein refers 

to the role cell motility plays in the invasion and 
potentially metastasis by tumor cells. 

The term "metastasis", as used herein refers to 
the spread of a disease process from one part of the body 
25 to another, as in the appearance of neoplasms in parts of 
the body remote from the site of the primary tumor; results 
in dissemination of tumor cells by the lymphatics or blood 
vessels or by direct extension through serious cavitites or 
subarachnoid or other spaces. 
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The term "steroid hormone receptor function", as 
used herein refers to physiological, cellular and molecular 
functioning of receptors sites that bind with steroid 
hormones . 

5 The term "substantially purified", as used 

herein, refers to nucleic acid or amino acid sequence that 
are removed from their natural environment, isolated or 
separated, and are at least 60% free, preferably 75% free, 
and most preferably 90% free from other components with 
10 which they are naturally associated. 

"Nucleic acid molecule" as used herein refers to 
an oligonucleotide, nucleotide, or polynucleotide, and 
fragments or portions thereof, and to DNA or RNA of genomic 
or synthetic origin which may be single or double stranded, 
15 and represent the sense or antisense strand. 

"Hybridization", as used herein, refers to any 
process by which a strand of nucleic acid binds with ■ a 
complementary strand through base pairing. 

The terms "complementary" or "complementarity", 
20 as used herein, refer to the natural binding of 
polynucleotides under permissive salt and temperature 
conditions by base-pairing. For example, the sequence 
" A-G-T binds to the complementary sequence "T-C-A" . 

The term "homology", as used herein, refers to a 
25 degree of complementarity. There may be partial homology 
or complete homology (i.e., identity). A partially 

complementary sequence is one that at least partially 
inhibits an identical sequence from hybridizing to a target 
nucleic acia and is referred to using the functional term 
30 "substantially homologous." The inhibition of 
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hybridization of the completely complementary sequence to 
the target: sequence may be examined using a hybridzation 
assay (Southern or northern blot, solution hybridization 
and the like) under conditions of low stringency. A 
5 substantially homologous sequence or probe will compete for 
and inhibit the binding (i.e., the hybridization) of a 
completely homologous sequence or probe to the target 
sequence under conditions of low stringency. 



The term "antisense", as used herein, refers to 
10 nucleotide sequences which are commplementary to a specific 
DNA or RNA sequence. The term "antisense strand" is used in 
reference to a nucleic acid strand that is complementary to 
the "sense" strand. Antisense molecules may be produced by 
any method, including synthesis by ligating the gene(s) of 
15 interest in a reverse orientation to a viral promoter which 
permits the synthesis of a complementary strand. Once 
introduced into a cell, this transcribed strand combines 
with natural sequences produced by the cell to form 
duplexes. These duplexes then block either the further 
20 transcription or translation. In this manner, mutant 
phenotypes may be generated. The designation "negative" is 
sometimes used in reference to the antisense, and 
"positive" is sometimes used in reference to the sense 
strand . 

25 "Amino acid sequence" as used herein refers to an 

oligopeptide, peptide, polypeptide, or protein sequence, 
and fragments or portions thereof, and to naturally 
occurring or synthetic molecules. Where "amino acid 
sequence" is recited herein this term excludes an amino 

30 acid sequence of a naturally occurring protein. "Amino 
acid sequence", "polypeptide" or "protein" are not meant to 
limit the amino acid sequence to the complete, native amino 
acid sequence associated with the recited protein molecule. 
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The term "functional fragments" or "fragments", 
as used herein, with regard to a protein refers to portions 
of that protein that are capable of exhibiting or carrying 
out the activity exhibited by the protein as a whole. The 
5 portions may range in size from three amino acid residues 
to the entire amino acid sequence minus one amino acid. 
For example, a protein "comprising at least a functional 
fragment of the amino acid sequence of SEQ ID NO:l", 
encompasses the full-length of the protein of SEQ ID NO : 1 
10 and portions thereof. 



A "derivative" of a BAG protein, as used herein, 
refers to an amino acid sequence that is alterd by one or 
more amino acids. The derivative may have "conservative" 
changes, wherein a substituted amino acid has similar 

15 structural or chemical properties, e.g., substitution of an 
apolar amino acid with another apolar amino acid (such as 
replacement of leucine with isoleucine) . The derivative 
may also have "nonconservative" changes, wherein a 
substituted amino acid has different but sufficiently 

20 similar structural or chemical properties that permits such 
a substitution without adversely effecting the desired 
biological activity, e.g., replacement of an amino acid 
with an uncharged polar R group with an amino acid with an 
apolar R group (such as replacement of glycine with 

25 tryptophan) , or alternatively replacement of an ammo acid 
with a charged R group with an amino acid with an uncharged 
Polar R group (such as replacement of lysine with 
asparagine ) . 
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Amino Acids - Apolar R Groups 



Amino Acid 


Radical 


Abbreviations 


3-Letter 


1 -Letter 


alanine 


methyl 


ala 


A 


valine 


2-propyl 


aal 


V 


leucine 


2 -methylpropyl 


leu 


L 


isoleucine 


2-butyl 


ile 


I 


proline 


propyl * - cyclized 


pro 


p 


phenylalanine 


benzyl 


phe 


F 


t rytophan 


3-indolylmethl 


t yr 


W 


methionine 


methyl thioethyl 


met 


M 


Amino Acids - Uncharged Polar R Groups 


Amino Acid 


Radical 


Abbreviations 


3-Letter 


1 -Letter 


glycine 


H 


gly 


G 


serine 


hydroxymethyl 


ser 


S 


threonine 


1-hydroxyethyi 


thr 


T 

X 


cysteine 


thiolmethyl 


cys 


c 


tyrosine 


4 -hydroxyphenylmethyl 


t y r 


Y 


asparagine 


aminocarbonylmethyl 


asn 


N 


glutamine 


aminocarbonylethyl 


gin 


Q 



20 Amino Acids - Charged R Groups 



Amino Acid 


Radical 


Abbreviations 


3-Letter 


1-Letter 


aspartic acid 


carboxymethyl 


asp 


D 


glutamic acid 


carboxyethyl 


glu 


E 


lysine 


4-aminobutyl 


1 y s 


K 


arginme 


3-guanylpropyl 


arg 


R 


hist idine 


4 -imidazoylmethyl 


his 


H 
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Similar minor modifications may also include amino acids 
deletions or insertions or both. Guidance in determining 
which amino acid residues may be modified as indicated 
above without abolishing the desired biological 
5 functionality may be determined using computer programs 
well known in the art, for example, DNASTAR software. In 
addition, the derivative may also result from chemical 
modifications to the encoded polypeptide, including but not 
limited to the following, replacement of hydrogen by an 

10 alkyl, acyl, or amino group; esterif ica tion of a carboxyl 
group with a suitable alkyl or aryl moiety; alkylation of 
a hydroxyl group to form an ether derivative. Further a 
derivative may also result from the substitution of a re- 
configuration amino acid with its corresponding D- 

15 configuration counterpart. 

The term "mimetic", as used herein, refers to a 
molecule, the structure of which is developed from 
knowledge of the structure of a protein/polypept ide or 
portions thereof (such as BAG-1) and, as such, is able to 
20 effect some or all of the actions of BAG-1 protein. 

"Peptide nucleic acid", as used herein, refers to 
a molecule which comprises an oligomer to which an amino 
acid residue, such as lysine, and an amino group have been 
added. These small molecules, also designated anti-gene 
25 agents, stop transcript elongation by binding to their 
complementary strand of nucleic acid (Nielsen, P.E. et al . , 
Anticancer Drug Des . 8:53-63 (1993)). 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a family of BAG-1 
30 related proteins from humans [3AG-1L ( SEQ ID N0:2), BAG-IS 
beginning at residue 116 of SEQ ID NO : 2 , BAG-2 (SEQ ID 
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NO:4), BAG- 3 (SEQ ID NO:6) and (SEQ ID NO:20), BAG - 4 (SEQ 
ID NO: 8) and (SEQ ID NO:22) and BAG- 5 (SEQ ID NO : 1 0 ) and 
(SEQ ID NO:24)], the invertebrate C. elegans [BAG-1 (SEQ ID 
NO: 12), BAG-2 ( SEQ ID NO: 14)] and the fission yeast S . pombe 
5 [ BAG- 1 A (SEQ ID NO:16), BAG-IB (SEQ ID NO : 16 ) ] , 
specifically the full length amino acid sequences 
comprising human BAG-1L (SEQ ID NO : 2 ) , BAG-1 (beginning at 
residue 116 of SEQ ID NO:2), and BAG-2 (SEQ ID NO : 4 ) C. 
elegans BAG-1 (SEQ ID NO:12), and BAG-2 (SEQ ID NO: 14), and 

10 S. pombe BAG- 1 A (SEQ ID NO : 1 6 ) and BAG-IB (SEQ ID NO:18); 
and partial sequences comprising human BAG-3 (SEQ ID NO: 6) 
and (SEQ ID NO:20), BAG- 4 (SEQ ID NO : 8 ) and (SEQ ID NO:22), 
and BAG -5 (SEQ ID NO:10) and (SEQ ID NO:24) and functional 
fragments thereof. In particular, the invention provides 

15 the amino acid sequences comprising human BAG-2 (SEQ ID 
NO: 4), BAG-3 (SEQ ID NO : 6 ) and (SEQ ID NO:20), BAG- 4 (SEQ 
ID NO:8) and (SEQ ID NO:22), and BAG-5 (SEQ ID NO:10) and 
(SEQ ID NO:24) proteins. 



Another aspect of the present invention provides 
20 the nucleic molecule and nucleotide sequences that encode 
the family of BAG-1 related proteins from humans [BAG-1 
(SEQ ID NO:l), BAG-2 (SEQ ID NO : 3 ) , BAG-3 (SEQ ID NO : 5 ) and 
(SEQ ID NO:19), BAG- 4 (SEQ ID NO : 7 ) and (SEQ ID NO : 2 1 ) and 
BAG-5 (SEQ ID NO : 9 ) and (SEQ ID NO:23)], the invertebrate 
25 C. elegans [BAG-1 (SEQ ID NO:ll), BAG-2 { SEQ ID NO:13)J and 
the fission yeast S . pombe [ BAG- 1 A (SEQ ID NO: 15), BAG-IB 
(SEQ ID NO: 17) ] . 



BAG-1L (SEQ ID NO : 2 ) is a multifunctional protein 
that blocks apoptosis, promotes tumor cell metastasis, and 
30 contributes to factor-independent and p5 3 -resi s tant cell 
growth. BAG-1L ( SEQ ID NO : 2 ) interacts with several types 
of proteins, including Bcl-2, some tyrosine kinase growth 
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factor receptors, steroid hormone receptors, and the p53- 
inciuced cell cycle regulator Siah-IA. 



BAG- 1 is a regulator cf Hsc70/Hsp70 family 
molecular chaperones. A carboxyl-terminal domain in this 
5 protein binds tightly to the ATPase domains of Hsc70 and 
Hsp70 (K D = 1 nM) (Zeiner, M., Gebauer, M . , and Gehring, U., 
EMBO J. 16: 5483-5490, (1997)). BAG- 1 modulates the 
activity of these molecular chaperones, acting as an 
apparent functional antagonist of the Hsp70/Hsc70- 

10 associated protein Hip (3-5) (Hohfeld, J. and Jentsch, S., 
EMBO J. 16: 6209-6216, (1997); Takayama, S., Bimston, D. 
N., Matsuzawa, S., Freeman, B. C., Aime-Sempe, C., Xie, Z., 
Morimoto, R. J., and Reed, J. C., EMBO J. 16: 4887-96, 
(1997); Zeiner, M., Gebauer, M . , and Gehring, U., EMBO J. 

15 16: 5483-5490, (1997)). In general, protein refolding is 
accomplished by Hsp70/Hsc70 through repeated cycles of 
target peptide binding and release, coupled to ATP 
hydrolysis (Ellis, R., Curr Biol. 7: R531-R533, (1997)). 
BAG-1 appears to promote substrate release, whereas Hip 

20 stabilizes Hsp70/Hsc70 complex formation with target 
peptides (Hohfeld, J., Minami, Y., and Harti, F.-U., Cell. 
83: 589-598, (1995)). Since each substrate interaction 
with Hsc70/Hsp70 is unique in terms of the optimal length 
of time the protein target should remain complexed with 

25 Hsc70/Hsp70 for achieving new conformations, the net effect 
of BAG-1 can be either enhancement or inhibition of the 
refolding reaction. 



The 70kd heat shock family proteins (Hsp70/Hsc70 ) 
are essential to a variety of cellular processes and have 
30 been implicated in cancer, yet it is unclear how these 
proteins are regulated in vivo. A variety of co-chaperones 

have been identified which may target Hsp70/Hsc70 to 
different subcellular compartments or promote their 
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interactions with specific protein or protein complexes. 
BAG- 1 appears to represent a novel Hsp70/Hsc70 regulator 
which differs functionally from ail other mammalian co- 
chaperones identified to date, such as members of the 
5 DnaJ-, Hip-, Hop-, and cyclophii in- families of proteins. 

Another aspect of the present invention provides 
the amino acid sequence of a binding domain of about 40 to 
55 amino acids that bind the a Hsc70/Hsp70 ATPase domain. 
The BAG domain is situated near the C-terminus, and the 
10 ubiquitin-like domains are situated near the N-terminus. 

The BAG family of proteins of the present 
invention contain a common conserved C-terminal domain (the 
"BAG" domain) that facilitates binding to the ATPase domain 
of Hsp70/Hsc70. The carboxyl - terminal domain of BAG - 1 

15 binds to the ATPase domain of Hsc70/Hsp70 and regulates its 
chaperone function by acting as a ADP-ATP exchange factor. 
Other domains of BAG - 1 mediate interactions with proteins 
such as Bcl-2 and retinoic acid receptors (RARs) , allowing 
BAG - 1 to target Hsc70/Hsp70 to other proteins, presumably 

2 0 modulating their function by changing their conformations. 

Human BAG- 1 was previously shown to inhibit 
Hsc70/Hsp70 dependent refolding of denatured protein 
substrates in vitro (S. Takayama , et al . , EMBO J 16, 4887- 
96 (1997); M. Zeiner, M. Gebauer, U. Gehring, EMBO J. 16, 

25 5483-5490 (1997); and J. Hohfeld, S. Jentsch, EMBO J. 16, 
6209-6216 (1997)). In Example III, Part A the effects of 
recombinant human BAG - 1 , BAG- 2 (SEQ ID NO : 4 ) and BAG- 3 (SEQ 
ID NO: 6) were compared using in vitro protein refolding 
assays similar to those employed previously for assessing 

30 BAG - 1 . The study showed that addition of equimolar amounts 
of each of the recombinant proteins to Hsc70 resulted in 
significant inhibition of luciferase refolding, with BAG- 2 
(SEQ ID N0:4) and BAG- 3 (SEQ ID NO : 6 ) showing somewhat 
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greater inhibitor activity than BAG- 1 (Figure 4A) . In a 
separate lucif erase folding study BAG - 1 , BAG- 2 (SEQ ID 
NO : 4 ) and BAG - 3 (SEQ ID NO : 6 ) once again displayed 
inhibition of luciferase refolding, however in this study 
5 varying amounts of BAG - 1 , BAG- 2 (SEQ ID NO : 4 ) and BAG - 3 
( SEQ ID NO: 6) were added relative to Hsc70 which resulting 
in concentration-dependent inhibition of Hsc70 chaperone 
activity, i.e., luciferase folding (Example III Part A) . 
Additional follow on studies using the same experimental 
10 protocols as the previous studies, as taught in Example 
IIA, have shown that BAG - 4 (SEQ ID NO: 22) also undergoes 
association with Hsc70/ATPase . 

Yet another aspect of the present invention 
provides a nucleotide sequence having at least about 15 

15 nucleotides and, generally, about 25 nucleotides, 
preferably about 35 nucleotides, more preferably about 45 
nucleotides, and most preferably about 55 nucleotides that 
can hybridize or is complementary under relatively 
stringent conditions to a portion of the nucleic acid 

20 sequences shown in Figures 1-9 and Figures 15-17, in 
particular the BAG domain as shown in in Figure IB, e.g., 
nucleotides 552-593 of human BAG - 3 , or nucleotides 167-221 
of human BAG- 4 . 

Yet another aspect of the present invention 

2 5 provides a compound of the formula, 

wherein, 

R N is a group of 1 to 552 independently selected 
amino acids ; 

3 0 R 1 is a group of 3 independently selected amino 

acids ; 
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X 1 is an amino acid with a charged or uncharged 
R group, such as aspartic acid, glutamic acid, asparagine, 
or glutamine; 

R 2 is a group of 7 independently selected amino 

5 acids; 

X 2 is an amino acid with a charged R group, such 



as glutamic acid; 
acids ; 



R 3 is a group of 5 independently selected amino 



10 X 3 is an amino acid with an apolar R group, such 

as leucine, methionine, or isoleucine; 

is a group of 3 independently selected amino 

acids ; 

X 4 is an amino acid with charged R group, such as 
15 aspartic acid or glutamine acid; 

R 5 is a single independently selected amino acid; 

X s is an amino acid with apolar or uncharged R 
group, such as leucine, valine, methionine, alanine or 
threonine ; 

20 R 6 is a group of 15 independently selected amino 

acids ; 

X 6 is an amino acid with a charged or uncharged 
R group, such as arginine , lysine, glutamine or aspartic 
acid ; 

25 R 7 is a group of 2 independently selected amino 

acids ; 

X 7 is an amino acid with a charged R group, such 
as arginine ; 

X 8 is an amino acid with a charged R group, such 
30 as arginine or lysine; 

R 9 is a group of 2 independently selected amino 

acids ; 



X 9 is an amino acid with an apolar R group, such 
35 R 10 is a group of 3 independently selected amino 



as valine; 
acids ; 
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X 10 is an amino acid with an uncharged R group, 
such as giutamine; 

R 11 is a group of 2 independently selected amino 

acids ; 

5 X 11 is an amino acid with an apolar R group, such 

as leucine; and 

R c is a group of 1 to 100 independently selected 
amino acids. 

A nucleotide sequence of at least about 15 
10 nucleotides and, generally, about 25 nucleotides, 
preferably about 35 nucleotides, more preferably about 45 
nucleotides, and most preferably about 55 nucleotides can 
be useful, for example, as a primer for the polymerase 
chain reaction (PCR) or other similar reaction mediated by 
15 a polymerase such as a DNA or RNA polymerase (see PCR 
Protocols: A guide to methods and applications, ed. Innis 
et al . (Academic Press, Inc., 1990), which is incorporated 
herein by reference; see, for example, pages 40-41) . In 
addition, such a nucleotide sequence of the invention can 
20 be useful as a probe in a hybridization reaction such as 
Southern or northern blot analysis or in a binding assay 
such as a gel shift assay. 



A nucleotide sequence of the invention can be 
particularly useful as an antisense molecule, which can be 

25 DNA or RNA and can be targeted to all or a portion of the 
5 ' -untranslated region or of the 5 1 -translated region of a 
bag-1 nucleic acid sequence in a cell. For example, an 
antisense molecule can be directed to at least a portion of 
the sequence shown as the BAG domain in Figure 1A, e.g., 

30 nucleotides 272-319 of human BAG- 1L (SEQ ID NO:l), or 
nucleotides 79-14 7 of human BAG- 5 (SEQ ID NO : 9 ) . Since the 
5 '-region of a nucleic acid contains elements involved in 
the control of expression of an encoded protein, an 
antisense molecule directed to the 5 '-region of a nucleic 
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acid molecule can affect the levels cf protein expressed in 
a cell . 



be useful as a probe to identify a genetic defect due a 
5 mutation of a gene encoding a BAG protein in a cell. Such 
a genetic defect can lead to aberrant expression of a BAG 
protein in the cell or to expression of an aberrant BAG 
protein, which does not properly associate with a Bel -2- 
related protein or Hsc70/Hsp70 protein in the cell. As a 
10 result, a genetic defect in a gene encoding, for example, 
human BAG- 1 can result in a pathology characterized by 
increased or decreased levels in protein folding. 

Further a nucleotide compound or composition as 
taught in the present invention can be synthesized using 

15 routine methods or can be purchased from a commercial 
source. In addition, a population of such nucleotide 
sequences can be prepared by restriction endonuclease or 
mild DNAse digestion of a nucleic acid molecule that 
contains nucleotides as shown in the nucleotide sequences 

20 shown in Figures 1-9 and Figures 15-17 that encodes the 
amino acids sequences also shown in Figures 1-9 and 
Figures 15-17. Methods for preparing and using such 
nucleotide sequences, for example, as hybridization probes 
to screen a library for homologous nucleic acid molecules 

25 are well known in the art (see, for example, Sambrook et 
al . , Molecular Cloning: A laboratory manual (Cold Spring 

Harbor Laboratory Press 1989); Ausubel et al . , Current 

Protocols in Molecular Biology (Green Publ . , NY 1989), 

each of which is incorporated herein by reference) . 

30 A particular nucleotide sequence can be designed 

based, for example, on a comparison of the nucleic acid 
molecules encoding any one of the BAG family proteins, as 
shown in Figures 1-9 and Figures 15-17, with another in the 
family. Such a comparison allows, for example, the 



A nucleotide sequence of the invention also can 




WO 00/14106 PCT/US99/21053 

22 



preparation of a nucleotide sequence that will hybridize to 
a conserved region present in both nucleic acid molecules, 
thus providing a means to identify homologous nucleic acid 
molecules present in other cell types or other organisms. 
5 In addition, such a comparison allows the preparation of a 
nucleotide sequence that will hybridize to a unique region 
of any of the BAG family nucleotide sequences, such as 
those corresponding to the BAG domain, thus allowing 
identification of other proteins sharing this motif. In 

10 this regard, it is recognized that, while the human BAG- 3 
proteins shown as Figures 3 and 20, and human BAG-5 
proteins shown as Figures 5 and 24, are only partial 
sequences, a variant human BAG - 3 or BAG-5 produced, for 
example, by alternative splicing can exist and can be 

15 identified using an appropriately designed nucleotide 
sequence of the invention as a probe. Such useful probes 
readily can be identified by inspection of the sequences 
shown in the disclosed Figures by a comparison of the 
encoding nucleotide sequences. 

20 If desired, a nucleotide sequence of the 

invention can incorporate a detectable moiety such as a 
radiolabel, a f luorochrome , a ferromagnetic substance, a 
luminescent tag or a detectable binding agent such as 
biotin. These and other detectable moieties and methods of 

25 incorporating such moieties into a nucleotide sequence are 
well known in the art and are commercially available. A 
population of labelled nucleotide sequences can be 
prepared, for example, by nick translation of a nucleic 
acid molecule of the invention (Sambrook et al . , supra, 

30 1989; Ausubel et al . , supra, 1989). 

One skilled in the art would know that a method 
involving hybridization of a nucleotide sequence of the 
invention can require that hybridization be performed under 
relatively stringent conditions such that nonspecific 
35 background hybridization is minimized. Such hybridization 
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conditions can be determined empirically or can be 
estimated based, for example, on the relative GC content of 
a sequence and the number of mismatches, if known, between 
the probe and the target sequence (see, for example, 
5 Sambrook et al . , supra, 1989). 



The invention further provides antibodies 
specific for human BAG family protein. As used herein, the 
term "antibody 11 includes polyclonal and monoclonal 
antibodies, as well as polypeptide fragments of antibodies 

10 that retain a specific binding activity for human BAG- 1 of 
at least about 1 x 10 5 NT 1 . One skilled in the art would 
know that anti-BAG-1 antibody fragments such as Fab, F(ab'). 
and Fv fragments can retain specific binding activity for 
human BAG - 1 (beginning at residue 116 of SEQ ID NO: 2) and, 

15 thus, are included within the definition of an antibody. 
In addition, the term "antibody" as used herein includes 
naturally occurring antibodies as well as non-naturally 
occurring antibodies and fragments that retain binding 
activity such as chimeric antibodies or humanized 

20 antibodies. Such non-naturally occurring antibodies can be 
constructed using solid phase peptide synthesis, can be 
produced recombinant ly or can be obtained, for example, by 
screening combinatorial libraries consisting of variable 
heavy chains and variable light chains as described by Huse 

25 et al., Science 246:1275-1281 (1989), which is incorporated 
herein by reference. 



One skilled in the art would know that purified 
BAG family protein, which can be prepared from natural 
sources or synthesized chemically or produced 

30 recombinantly , or portions of a BAG family protein, 
including a portion of human BAG family protein such as a 
synthetic peptide as described above, can be used as an 
immunogen. Such peptides useful for raising an antibody 
include, for example, peptide portions of the N-terminal 85 

3 5 amino acids or the BAG domain cf any of the human BAG 
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proteins (see Figure IB) . A particularly advantageous use 
of such a protein is for the immunos taining , wherein the 
methods provides a process to contrast the immunos taining 
of BAG-family proteins in carcinoma cells with adjacent 
5 non-neoplast ic prostatic epithelial and basal cells which 
are generally present in the same tissue sections. These 
results would be correlated with a Gleason grade to 
determine whether any of the BAG-family proteins tend to be 
expressed at higher or lower levels in histologically 
10 advanced tumors. From this process a determination can be 
made as to degree at which the disease is progressing in a 
given patient, i.e., a prognosis can be made. 

Non- immunogenic fragments or synthetic peptides 
of BAG proteins can be made immunogenic by coupling the 

15 hapten to a carrier molecule such bovine serum albumin 
(BSA) or keyhole limpet hemocyanin (KLH) , as described in 
Example IV, below. In addition, various other carrier 
molecules and methods for coupling a hapten to a carrier 
molecule are well known in the art and described, for 

2 0 example, by Harlow and Lane, Antibodies : A laboratory 
manual (Cold Spring Harbor Laboratory Press, 1988), which 
is incorporated herein by reference. 

EXAMPLES 



The following examples are given to enable those 
25 skilled in the art to more clearly understand and to 
practice the present invention. They should not be 
considered as limiting the scope of the invention, but 
merely as being illustrative and representative thereof. 
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EXAMPLE I 

Isolation and Characterization 
of BAG-family cDNA Sequences 

This example describes methods for isolating and 
5 characterizing of BAG-family cDNA sequences from human, 
nematode and yeast . 

A. Cloning of human BAG cDNA sequences 

Yeast two-hybrid library screening of a human 
Jurkat cell cDNA library was performed as described by 
10 Takayama et al . , EMBO J. , 16:4887-96 (1997); Matsuzawa et 
al . , EMBO J . . 17:2736-2747 (1998), which are incorporated 
herein by reference) using EGY48 strain yeast transformed 
with pGilda-Hsc70/ATPase (67-377 amino acids) and the la.cZ 

reporter plasmid pSH18-34. Of the resulting ~5 x 10 6 
15 transformants, 112 Leu" colonies were obtained after 

1 week incubation at 30°C. Assay of 3-galactosidase (3-gal) 
activity of these colonies resulted in 96 clones. Mating 
tests were then performed using RFY206 yeast strain 
transformed with pGilda, pGilda mB AG - 1 (1-219), or pGilda 
20 Hsc70/ATPase . Of these, 66 displayed specific interactions 
with Hsc70/ATPase. The pJG4-5 cDNAs were recovered using 
KC8 E. coli strain which is auxotrophic for tryptophan 

(Trp) . DNA sequencing revealed 3 partially overlapping 
human BAG- 1 , 4 identical and one overlapping cDNAs encoding 
25 BAG- 2, and 2 partially overlapping BAG- 3 clones. 

Using the above described yeast two-hybrid screen 
with the ATPase domain of Hsc70 as "bait", several human 
cDNAs were cloned which encode portions of BAG- 1 or of two 
other BAG-l-like proteins which are termed BAG- 2 (SEQ ID 
30 NO:4) and BAG - 3 (SEQ ID NO : 6 ) . The longest of the cDNAs 
for BAG- 2 (SEQ ID NO : 3 ) and BAG- 3 (SEQ ID NO : 5 ) contained 
open reading frames (ORFs) of 207 and 162 amino acids, 
respectively, followed by stop codons. All BAG- 1 (SEQ ID 
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NO:l), BAG- 2 (SEQ ID NO : 3 ) and BAG- 3 (SEQ ID NO : 5 ) cDNAs 
obtained by two-hybrid library screening with Hsc70/ATPase 
contained a conserved domain of about 40-50 amino acids 
which are termed the "BAG" domain and are shown in Figure 
5 10. These results demonstrate that a family of BAG - 1 - 
related proteins all contain a conserved ~45 amino acid 
region near their C-terminus that binds Hsc70/Hsp70. 



B . Identification of additional BAG-familv proteins 

A search of the translated Genbank database using 
10 the bBLAST and FASTA search programs also identified human 
ESTs that provided sequences for further investigation of 
BAG-family proteins. The putative BAG -4 (SEQ ID NO : 8 ) and 
BAG- 5 (SEQ ID NO: 10) proteins contain BAG-domains that 
share the greatest sequence similarity with the BAG-domain 
15 of BAG - 3 (SEQ ID NO : 6 ) - These were designated BAG -4 
(Accession number AA693697, N74588) and BAG- 5 (Accession 
number AA456862, N34101) . BAG- 4 has 62% identity and "81% 
similarity to BAG - 3 , and BAG- 5 has 51% identity and ~75% 
similarity to BAG -3 . 

2 0 Additional BAG-family orthologues or homologues 

were also identified using computer-based searches and 
resulted in BAG-family homologue in the nematode C. elegans 

and the fission yeast S. pombe . The C. elegans genome 

encodes two apparent BAG-family proteins, which are most 
25 similar in their overall sequences to the human BAG - 1 
(Afo39713, gi:2773211) (SEQ ID NO:12) and BAG - 2 (SEQ ID 
N0:14) (Afo68719, gi:3168927). The S. pombe contains two 

BAG-family proteins that share the greatest overall 
sequence similarity with human BAG- 1 (Alo23S54 , gi/3 133105 
30 and Alo23634, gi/3150250). The human and C. elegans BAG- 1 

proteins as well as S. pombe BAG - 1A all have ubiquitin-like 
domains near their N-termini (see Figure 10A) of unknown 
function . 
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The overall predicted amino acid sequences of the 
C. elegans BAG - 1 (SEQ ID NO: 12) and S. pombe BAG- 1 A (SEQ ID 
NO:16) proteins are "18% identical (~61% similar) and ~17% 
identical (~64% similar), respectively, to human BAG - 1 , 
5 implying origin from a common ancestral gene. The C. 
elegans BAG- 1 protein (SEQ ID NO:12), however, contains a 
5 to 7 amino acid insert in its BAG-domain as compared to 
the human, murine, and yeast BAG- 1 homologues (see Figure 
10B) , and is more similar to BAG - 2 (SEQ ID NO : 4 ) in regard 

10 to its BAG-domain. C. elegans and human BAG - 2 also may be 
derived from a common ancestor as the C-terminal 225 amino 
acid region which encompasses both the BAG domain and 
upstream region of both C. elegans and human BAG- 2 share 
"34% amino acid sequence identity and ~70% similarity. The 

15 human BAG- 2 protein (SEQ ID NO:4), however, contains a 9 
amino acid insert in its BAG-domain compared to it 
C. elegans counterpart (see Figure 10B) . Evolutionary- tree 
prediction algorithms suggest that human and C. elegans 
BAG- 2 represent a distinct branch of the BAG- family that is 

20 more evolut ionarily distant from the other BAG-family 
proteins. None of the predicted BAG-family proteins 
contain recognizable regions analogous to those found in 
other Hsc70 regulatory proteins, such as the J-domains and 
G/F-domains of DnaJ family proteins and the 

25 Tetratricopeptide Repeat (TR) domains of Hip/Hop family 
proteins . 

r Yga^t two-hybrid assay o f BAG binding to Hsc70/ATPage 

The longest of the cDNAs obtained for the BAG- 2 
and BAG- 3 proteins were expressed with N-terminal 
30 transactivation (TA) domains in yeast and tested by yeast 
two-hybrid assay for interactions with fusion proteins 
consisting of Ksp70/ATPase or a variety of unrelated 
proteins (Fas, Siah, Fadd) containing N-terminal LexA DNA- 
binding domains. TA-BAG- 2 and TA- BAG- 3 demonstrated 
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positive interactions with LexA-Hsc70/ATPase , resulting in 
transactivation of a lacZ reporter gene that was under the 

control of LexA operators (Figure 11A) . No interactions 
with LexA- Fas (cytosolic domain) , LexA-Siah, LexA-Fadd, or 
5 LexA were detected (see Figure 11A) demonstrating that the 
BAG- 2 and BAG- 3 proteins interact specifically with 
Hsc70/ATPase. Specific two-hybrid interactions between 
Hsc70/ATPase and either BAG- 2 or BAG- 3 were also observed 
when BAG- 2 and BAG- 3 were expressed as LexA DNA-binding 
10 domain fusion proteins and Hsc70/ATPase was fused with a TA 
domain (see Figure 11A; right panel) . These results 
demonstrate that similarly to BAG - 1 , BAG- 2 and BAG- 3 
specifically interact with Hsc7 0 /ATPase . 

In order to determine whether the BAG proteins 
15 are capable of forming heterodimers , coexpression of BAG- 2 
and BAG- 3 in the yeast two-hybrid assay was also performed. 
Coexpression of BAG - 2 and BAG - 3 failed to show interaction 
with BAG- I or a deletion mutant of BAG- 1 (AC) which is 
missing part of its C-terminal domain required for 
20 Hsp70/Hsc70 binding suggest that these proteins do not form 
heterdimers . 

D. Isolation and characterization of the complete open 
reading frame sequences of BAG- 2 and BAG- 3 

In order to deduce the complete ORFs of BAG-2 and 
25 BAG- 3 , a A-phage cDNA library was screened as follows, 
using hybridization probes derived from the two-hybrid 
screening. A human jurkat T-cell A-ZapII library cDNA 
library (Stratagene) was screened by hybridization using 
3: P-labeled purified insert DNA from the longest of the 
30 human BAG-2 (clone #11) and human BAG - 3 (clone #28) cDNA 
clones. From about one million clones screened, 38 BAG-2 
and 23 BAG- 3 clones were identified, cloned, and their cDNA 
inserts recovered as pSKII plasmids using a helper phage 
method (Stratagene) . DNA sequencing of A-phage derived 
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human BAG- 2 cDNA clones revealed an ORF encoding a 
predicted 211 amino acid protein, preceded by an in-frame 
stop codon. The longest human BAG- 3 A-phage cDNA clone 
contains a continuous ORF of 682 amino acids followed by a 
5 stop codon, but without an identifiable start codon (see 
Figure 10A) . 

Although BAG - 1L (SEQ ID NO : 2 ) , BAG - 1 (beginning 
at residue 116 of SEQ ID NO:2), BAG - 2 (SEQ ID NO:4), and 
BAG- 3 ( SEQ ID NO : 6 ) all contain a homologous BAG domain 

10 near their C-terminus, the N-terminal regions of these 
proteins are dissimilar. Using a combination of search 
tools (Prosite Search: PP search, using the Prosite pattern 
database, BCM Search Launcher, Baylor College of Medicine, 
and Blocks Search) , it was determined that the BAG - 2 N- 

15 terminal region contains potential kinase phosphorylation 
sites but otherwise shares no apparent similarity with 
other proteins or known functional domains. 



BAG - 3 contains a WW domain as shown in Figure 10A. WW 
20 domains have been identified in a wide variety of signaling 
proteins, including a Yes kinase adaptor protein (YAP), the 
Na" -channel regulator Nedd4 , f ormin-binding proteins, 
dystrophin, and the peptidyl prolyl cis-trans-isomerase 
Pin-1. These roughly 40 amino acid domains mediate protein 
25 interactions and bind the preferred peptide ligand sequence 
xPPxY (Sudol., TIBS , 21: 161-163, 1996, which is 
incorporated herein by reference) . 



In contrast, 



the predicted N-terminal region 
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EXAMPLE II 



In vitro Association of 
BAG proteins and Hsc7C/ATPase 

This example demonstrates that BAG -2 (SEQ ID 
5 NO:4) and BAG - 3 ( SEQ ID NO : 6 ) bind Hsc70/ATPase in various 
In vitro assays. 



A . Solution binding assay of BAG - 2 and BAG - 3 to 

Hsc70/ATPase 

Association of BAG- 2 (SEQ ID NO : 4 ) and BAG- 3 (SEQ 
10 ID NO : 6 ) with Hsc70/ATPase was determine by an in vitro 
protein binding assay where Hsc70/ATPase or BAG-family 
proteins were expressed in bacteria as Glutathione S- 
Transf erase (GST) fusion proteins. Purified cDNA sequences 
encoding residues 5 to 211 of human BAG- 2 (clone #11) and 
15 the C-terminal 135 amino acids of human BAG- 3 (clone #28) 
(see Figure 10A) were subcloned into the EcoRI/Xho I sites 
of pGEX4T-l prokaryotic expression plasmid (Pharmacia; 
Piscataway, NJ) . These plasmids as well as pGEX4T- 1 -BAG- 1 , 
pGEX-4T-l-BAG-l (AC) , and pGEX-4T-l-XL which have been 
20 described previously (Takayama et al ., supra (1997); Xie et 

al . , Biochemistry , 37:6410-6418, (1998), which are 
incorporated herein by reference) , were expressed in XL-1 
blue strain E . Coli (Stratagene, Inc., La Jolla, CA) . 

Briefly, a single colony was inoculated into 1L of LB media 
25 containing 50 /ig/ml ampicillin and grown at 37°C overnight. 
The culture was then diluted by half with fresh 
LB/ampicillin and cooled to room temperature for 1 hr, 
before inducing with 0.4mM IPTG for 6 h at 25°C. 

Cells were recovered and incubated with 0 . 5 mg/ml 
30 lysozyme in 50 mM Tris (pH 8.0), 150 mM NaCl , 1% Tween-20, 
0.1% 2-mercaptoethanol, 5 mM EDTA , 1 mM PMSF and a mixture 
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of other protease inhibitors obtained from Boehringer 
Mannheim (1697498) at room temperature for 0.5 h, followed 



centrifugation at 27,500g for 10 min and the resulting 
5 supernatants were incubated with 30 ml of glut athionine - 
Sepharose (Pharmacia) at 4°C overnight. The resin was then 
washed with 20 mM Tris (pH 8.0), 150 mM NaCl , 0.1% Tween- 
20, and 0.1% 2 -mercaptoethanol until the OD 280nm reached 
<0.01. For removal of GST, the resin with immobilized GST- 

10 fusion protein was incubated with 10U of thrombin 
(Boehringer, Inc.) at 4°C in 20 mM Tris (pH 8.0), 150 mM 
NaCl, 0.1% Tween-20, 0.1% 2 -Mercaptoethanol , and 2.5 mM 
CaC12 overnight. Released proteins were then purified on 
Mono Q (HR10/10, Pharmacia) by FPLC using a linear gradient 

15 of 0 . 5M NaCl at pH 8.0 and dialyzed into chaperone assay 
buffer . 

The ability of BAG -2 (SEQ ID NO: 4) or BAG- 3 (SEQ 
ID NO: 6) to bind Hsc70/ATPase in solution was then 
examined. GST control or GST-BAG proteins were immobilized 
20 on glutathione-Sepharose and tested for binding to 35S- 
labeled In vitro translated (IVT) proteins. 

Immunoprecipitation and in vitro GST-protein binding assays 
were performed as described by Takayama et al . , supra. 
(1997), using pCI-Neo flag or pcDNA3-HA into which human 
25 Bag-2 (clone #11) or human BAG - 3 (clone #28) had been 
subcloned for in vitro translation of 35S-L-methionine 
labeled proteins or expression in 293T cells. As shown in 
Figure 11B, 35 S -Hsc70/ATPase bound in vitro to GST -BAG- 1 , 

GST-BAG-2 , and GST -BAG- 3 but not to GST - BAG -1 (AC) or 
30 several other control proteins. BAG- 1 (beginning at 
residue 116 of SEQ ID NO : 2 ) , BAG -2 (SEQ ID NO : 4 ) , and BAG - 3 
(SEQ ID NO: 6) also exhibited little or no binding to 
themselves or to each other, demonstrating that these 
proteins do not strongly homo- or hetero-dimerize or 
35 oligomerize. It should be noted, however, that BAG-2 (SEQ 



by sonication. 



Cellular debris were pelleted by 
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ID N0:4) displayed weak interactions with itself in binding 
assays and produced a positive result in yeast two-hybrid 
experiments, demonstrating that it can have the ability to 
self-associate. 

5 B. Binding of BAG proteins to Hsc70 in vivo 



ID NO: 6) proteins to interact in cells with Hsc70 was 
tested by expressing these proteins with N-terminal Flag 
epitope tags in 293T human epithelial cells using co- 
10 immunoprecipitation assays as described previously 
(Takayama et al . , supra (1997);. cDNAs encoding the A- 

phage cloned regions of BAG- 2 and BAG- 3 were subcloned in- 
frame into pcDNA3 - Flag . Anti-Flag immune complexes 
prepared from 293T cells after transfection with plasmids 

15 encoding Flag -BAG - 1 , Flag-BAG-2, or Flag-BAG- 3 were 
analyzed by SDS-PAGE/immunoblot assay. As shown in Figure 
10C, antiserum specific to Hsc70 detected the presence of 
BAG proteins associated with Hsc70, whereas control immune- 
complexes prepared with IgGl as well as anti-Flag immune 

20 complexes prepared from cells transfected with Flag-tagged 
control proteins, Daxx and Apaf-1, did not contain Hsc70 
associated protein. These results further demonstrate that 
BAG-family proteins specifically bind to Hsc70 . 

C. BIAcore assay of BAG protein binding to the ATPase 
2 5 domain of Hsc70 



is known to bind tightly to the ATPase domain of Hsc70 
(Stuart et al . , J. Biol . Chem. . In Press (1998)). BAG - 2 
(SEQ ID NO:4) and BAG - 3 (SEQ ID NO : 6 ) proteins were 
30 therefore, examined for their ability to bind to 
Hsc70/ATPase . The affinity and binding kinetics of BAG - 2 
(SEQ ID N0:4) and BAG - 3 (SEQ ID NO : 6 ) to Hsc70/ATPase was 
also compared to that of BAG-1 (beginning at residue 116 of 



The ability of BAG - 2 (SEQ ID NO : 4 ) and 



BAG -3 (SEQ 



BAG-1 (beginning at residue 116 of SEQ ID NO:2) 
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SEQ ID NO : 2 ) for Hsc7 0 /ATPase , using a surface plasmon 
resonance technique (BIAcore) which has been described 
previously (Stuart et al . , supra., (1998) which is 

incorporated herein by reference) . 



5 BAG- family proteins were produced in bacteria and 

purified to near homogeneity as shown in Figure 12A and 
described above in Example I. The purified BAG- 1 

(beginning at residue 116 of SEQ ID NO : 2 ) , -2 (SEQ ID 
NO : 4 ) , and -3 (SEQ ID NO : 6 ) proteins were then immobilized 

10 on biosensor chips and tested for their interactions with 
Hsc70 in the soluble phase. Kinetic measurements were 
performed using a BIAcore- II instrument with CMS sensor 
chip and Amine Coupling Kit (Pharmacia Biosensor AB, 
Sweden). Briefly, for immobilization of proteins, the 

15 sensor chip was equilibrated with HK buffer (10 mM Hepes 
(pH 7.4), 150 mM KCL) at 5/il/min, then activated by 
injecting 17 fil of 0 . 2M N-ethyl -N ' - (3 -diethylaminopropyl ) - 
carbodiimide and 0.05M N-hydroxysuccinitnide (NHS/EDC) 
followed by 35 fil of the protein of interest, in 10 mM 

20 acetate, pH 3.5-4.5. Excess NHS-ester on the surface was 
deactivated with 17 [il 1M e thanol amine - HCL (pH8.5). After 
immobilization, 5ptl of regeneration buffer (50 mM phosphate 
(pH 6.8) and 4M GuHCl ) was injected. For binding assays, 
Hsp70 (Sigma, H8778) was dissolved in HK buffer, and 

25 injected at 10 /il/min across the prepared surface at 
various concentrations. The surface was regenerated after 
each injection with 5 fil of regeneration buffer. The rate 
constants K ass and K diss were generated with BIAevaluat ion 
softward 3.01 (Pharmacia Biosensor AB) . Addition of Hsc70 

30 to chips containing BAG- 1 (beginning at residue 116 of SEQ 
ID NO:2), BAG - 2 (SEQ ID NO : 4 ) or BAG -3 (SEQ ID NO : 6 ) 
resulted in concentration-dependent binding, as reflected 
by an increase in the Response Units (RU) measured at the 
chip surface (shown in Figure 3B) . In contrast, Hsc70 

35 failed to display interactions in BIAcore assays with a 
variety of control proteins as well as a mutant of BAG - 1 
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lacking a C-terminal portion of the BAG domain which is 
required for Hsc7 0 -binding (Figure 3B) . Furthermore, 
flowing of various control proteins such as GST, BSA and 
Bel -XL over the BAG - 1 (beginning at residue 116 of SEQ ID 
5 NO:2) , BAG - 2 (SEQ ID NO : 4 ) , or BAG - 3 (SEQ ID NO : 6 ) chips 
resulted in negligible interaction. These results further 
demonstrate the specificity with which BAG-family proteins 
interact with and bind to Hsc70. 



The rates of Hsc70 binding to BAG- 1 (beginning at 

10 residue 116 of SEQ ID NO : 2 ) , BAG -2 (SEQ ID NO : 4 ) , and BAG -3 
(SEQ ID NO: 6) proteins were similar, following pseudo 
first-order kinetics with estimated association rate 
constants (kJ of 2.1, 2.1 and 2.4 x 10 5 M" 1 sec" 1 , 
respectively. After allowing binding of Hsc70 to 

15 immobilized BAG - 1 (beginning at residue 116 of SEQ ID 
NO:2) , BAG -2 (SEQ ID NO : 4 ) , or BAG - 3 (SEQ ID NO:6) to reach 
plateau levels, the chaperone was removed from the flow 
solution and the dissociation rate was monitored. BAG - 1 
(beginning at residue 116 at SEQ ID NO: 2) and BAG -2 (SEQ ID 

20 NO:4) exhibited similar dissociation rates, with relatively 
slow loss of Hsc70 from the chip surface, resulting in 
estimated dissociation rate constants (K d ) of 3.0 and 5.0 x 
10' 4 sec" 1 , respectively (see Figure 3B) . In contrast, Hsc70 
dissociated more rapidly from biosensor chips containing 

25 BAG - 3 (see Figure 3B) , yielding an estimated K d of 1.7 x 10" 3 
sec" 1 . From the kinetic data, the apparent affinities (k d 
= K d /K a ) were calculated for binding of Hsc70 to BAG - 1 
(beginning at residue 116 of SEQ ID NO: 2) , BAG - 2 (SEQ ID 
NO:4) , and BAG - 3 (SEQ ID NO : 6 ) and were estimated to equal 

3 0 about K D = 1.4nM, K D =2.4nM, and K D =7.4nM, respectively. These 
results demonstrate that the interactions of BAG-family 
proteins with Hsc70 occur with apparent affinities 
sufficient for physiological relevance. 
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EXAMPLE III 



BAG-family proteins inhibit 
Hsp7 0 /Hsc7 0 - dependen t protein folding 

This example demonstrates that BAG - 2 (SEQ ID 
5 NO:4) and BAG- 3 (SEQ ID NO : 6 ) proteins inhibit Hsp70/Hsc70- 
dependent refolding of denatured proteins similarly to a 
BAG- 1 (beginning at residue 116 of SEQ ID NO : 2 ) protein. 

The effects of BAG - 2 (SEQ ID NO : 4 ) and BAG - 3 (SEQ 
ID NO: 6) protein on Hsp70/Hsc70 -dependent protein refolding 

10 was determined using in vitro protein refolding assays 
similar to those described previously by Takayama et al . , 
supra, 1998: Terada et al., J Cell Biol , . 139:1089-1095, 
1997, which are incorporated herein by reference. Briefly, 
lucif erase (20/iM) was denatured in 25 mM Hepes-KOH, pH 7.2, 

15 50 mM potassium acetate, 5 mM DTT, 6M guanidine 
hydrochloride at ~25°C for 1 h. Denatured lucif erase was 
diluted 1:40 into 25 mM Hepes-KOH, pH 7 . 2 , 50 mM potassium 
acetate, 5 mM DTT. Hsc70 (1.8 /zM) , DnaJ (StressGen, Inc.) 
(0.9/iM), and various purified recombinant proteins as 

20 indicated were added to refolding buffer (30 mM Hepes-KOH, 
pH 7.6, 120 mM potassium acetate, 3mM magnesium acetate, 2 
mM DTT, 2.5 mM ATP) with 0.2 volume of diluted denatured 
lucif erase to a final concentration of 0.1 /iM. Lucif erase 
activity was measured after 1.5 hr incubation at 35°C. 

25 The combination of Hsc70 and DnaJ resulted in 

ATP -dependent refolding of chemically denatured firefly 
lucif erase, with function of over half the denatured enzyme 
restored in a 90 minute reaction, as monitored by a 
chemiluminescence assay. In contrast, neither Hsc70 nor 

30 DnaJ alone were able to induce substantial refolding of 
denatured lucif erase. Furthermore, little spontaneous 



BNSDOCID <WC 0014106A1 J > 




WO 00/14106 PCT/US99/21053 

36 

restoration of luciferase activity was observed with 
control proteins, BSA, GST or Bcl-XL (see Figure 4A) . 

Addition of recombinant purified BAG- 1 (beginning 
5 at residue 116 of SEQ ID NO : 2 ) , BAG - 2 (SEQ ID NO : 4 ) , or 
BAG- 3 (SEQ ID NO : 6 ) to the above assays in amounts 
equimolar to Hsc70 (1.8 fiM) resulted in striking inhibition 
of luciferase refolding. BAG - 2 (SEQ ID NO : 4 ) and BAG - 3 
(SEQ ID NO: 6) displayed somewhat greater inhibitory 
10 activity than BAG - 1 (beginning at residue 116 of SEQ ID 
NO : 2 ) as shown in Figure 4A. In contrast, the BAG-1 (AC) 
protein, which fails to bind Hsc70 as well as several other 
control proteins, had no effect on luciferase refolding. 

In an additional refolding assay, described 

15 previously by Minami et al . , J Biol. Chem . 271:19617-24, 
1996), purified Hsc70 and human DnaJ homolog Kdj-1 (Hsp 40) 
were used with additional cofactors provided in 
reticulocyte lysates (5% v:v) to produce a system capable 
of refolding denatured luciferase. Briefly, additional 

20 cofactors included, recombinant Luciferase (Promega: 
QuantiLum TM) , that had been heat denatured at 42°C for 10 
min, 1.8 fj.M Hsc70 (Sigma; purified from bovine brain), 0.9 
fiM Hsp4 0, and various recombinant purified proteins. 
Luciferase activity was measured (Promega luciferase assay 

25 kit) using a luminometer (EG&G Berthold, MicroLumat 
luminometer, Model #LB96P) . All results were normalized 
relative to non-denatured luciferase that had been 
subjected to the same conditions. Control reactions 
lacking ATP, Hsc70, or Hsp40 resulted in negligible 

30 luciferase refolding. 

Various amounts of purified BAG-1 (beginning at 
residue 116 of SEQ ID NO : 2 ) , BAG - 2 (SEQ ID NO : 4 ) , or BAG - 3 
(SEQ ID NO:6), relative to amounts of Hsc70 were used in 
the above-described protein refolding assay. Addition of 
35 BAG- family proteins resulted in a concentration-dependent 
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inhibition of Hsc70 chaperone activity. Furthermore, the 
BAG - 2 (SEQ ID NO : 4 ) and BAG - 3 (SEQ ID NO : 6 ) inhibition of 
Hsc70 chaperone activity was demonstrated to be as potent 
as that observed for BAG - 1 (beginning at residue 116 of SEQ 
5 ID NO:2) . In contrast, the BAG- 1 (AC) mutant as well as 
other control proteins did not suppress Hsc70 -mediated 
refolding of denatured lucif erase. These results indicate 
that BAG - 2 (SEQ ID NO : 4 ) and BAG - 3 (SEQ ID NO : 6 ) can 
inhibit Hsc70/Hsp70 dependent protein refolding activity to 
10 the same extent as BAG - 1 (beginning at residue 116 of SEQ 
ID NO: 2) . 



B ttAG competes wit h Hip for binding to Hgc7Q t 

It is known that BAG - 1 competes with Hip for 
binding to Hsc70, with these proteins exerting opposite 

15 effects on Hsc70 -mediated protein refolding (Hohfeld, J., 
and Jentsch, S., Emho J., 16:6209-6216, 1997, which is 
incorporated herein by reference) . In order to determine 
whether BAG- 2 (SEQ ID NO : 4 ) and BAG- 3 (SEQ ID NO : 6 ) also 
compete with Hip for binding to Ksc7Q , refolding assays 

20 were performed as described above in the presence of Hip 
protein . 

Hip was purified as His 5 -protein . The fusion 
protein was induced from pET28-Hip (V. Prapapanich et al . , 
Mol Cell Biol., 18:944-952, 1998, which is incorporated 

25 herein by reference) with 0.1 niM IPTG at 25°C for 6h in BL21 
cells. Cells from 1L of culture were resuspended into 50 
ml of 50 mM Phosphate buffer (pH 6.8), 150 mM NaCl , and 1% 
(v/v) Tween-20 and then incubated with 0.5 rag/ml lysozyme 
at 25°C for 0.5h, followed by sonication. After 

30 centrif ugation at 27,500g for 10 min, the resulting 
supernatant was mixed with 15 ml nickel resin (Qiagen, 
Inc.) at 4°C for 3 h with 25 mM imidazol . The resin was 
then washed with 50 mM phosphate buffer (pH 6.8), 25 mM 
imidazol, 150 mM NaCl and 0.1% Tween-20 until the OD280nm 
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reached a value of <0.01. His 0 -Hip protein was eluted with 
250 mM imidazol in washing buffer (Qiagene, Inc.) and 
purified on Mono Q (HR10/10 Pharmacia) by FPLC using a 
linear gradient of 0 . 5M NaCl at pH 8.0, followed by 
5 dialysis in chaperone assay buffer. 

In the refolding assay reactions, addition of 
purified Hip at equimolar concentrations relative to BAG- 1 
(beginning at residue 116 of SEQ ID NO : 2 ) , BAG - 2 (SEQ ID 
NO:4), or BAG - 3 (SEQ ID NO : 6 ) (1.8 jjlM) completely negated 

10 the inhibitory effects of the BAG-family proteins on 
refolding of denatured luciferase (see Figure 4C) . These 
results demonstrate that the suppression of Hsc70 chaperone 
activity by BAG-family proteins is reversible, and that Hip 
antagonizes the effects of not only BAG- 1 (beginning at 

15 residue 116 of SEQ ID NO : 2 ) , but also of BAG - 2 (SEQ ID 
NO: 4) and BAG- 3 (SEQ ID NO : 6 ) . 

In summary, these results demonstrate that BAG- 
family proteins all contain a conserved BAG domain near 
their C- terminus that binds Hsc70/Hsp70, and that human 
20 BAG-family proteins can bind with high affinity to the 
ATPase domain of Hsc70 and inhibit its chaperone activity 
through a Hip- repressable mechanism. 

EXAMPLE IV 

EXPANDED NUCLEIC ACID AND AMINO ACID SEQUENCES 
2 5 FOR HUMAN BAG - 3 , BAG - 4 AND BAG - 5 

Following the procedures disclosed herein, the 
nucleic acid and amino acids sequences to human BAG - 3 , 
BAG- 4 and BAG- 5 were further expanded. The expanded 
sequences for BAG - 3 , BAG -4 and BAG- 5 are shown in 
30 Figures 15, 16 and 17, respectively, with their respective 
sequence identification numbers, " SEQ ID NO"s. 
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We claim: 



A compound of the formula, 



R N .R 1 X 1 R 2 X 2 R 3 X 3 R 4 X 4 R 5 X 5 R 6 X 6 R 7 X 7 X 8 R 9 X 9 R 10 X 10 R 11 X 11 -R' 



c 



wherein, 



10 



15 



20 



25 



R N is a group of about 1 to 552 independently 

selected amino acids; 
R 1 is a group of 3 independently selected amino 

acids ; 

X 1 is an amino acid with a charged or uncharged 
R group; 

R 2 is a group of 7 independently selected amino 
acids ; 

X 2 is an amino acid with a charged R group; 
R 3 is a group of 5 independently selected amino 
acids ; 

X 3 is an amino acid with an apolar R group ; 
R 4 is a group of 3 independently selected amino 
acids ; 

X A is an amino acid with charged R group; 
R 5 is a single independently selected amino acid; 
X 5 is an amino acid with apolar or uncharged R 
group ; 

R 6 is a group of 15 independently selected amino 
acids ; 

X 6 is an amino acid with a charged or uncharged 
R group; 

R 7 is a group of 2 independently selected amino 
acids ; 

X 7 is an amino acid with a charged R group; 
X 8 is an amino acid with a charged R group; 
R 9 is a group of 2 independently selected amino 
acids ; 

X 9 is an amino acid with an apolar R group; 
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R lQ is a group of 3 independently selected amino 
acids ; 

X 10 is an amino acid with an uncharged R group; 
R 1 ^ is a group of 2 independently selected amino 
5 acids; 

X^ 1 is an amino acid with an apolar R group; and 
R c is a group of about 1 to 100 independently 
selected amino acids . 

2. A substantially purified nucleic acid 
10 molecule having a nucleotide sequence corresponding to or 
complementary to at least 20 nucleotides from a nucleotide 
sequence selected from the group consisting of (SEQ ID 
NO:l) , (SEQ ID NO : 3 ) , (SEQ ID NO : 5 ) , (SEQ ID NO : 7 ) , (SEQ ID 
NO:9) , (SEQ ID NO:19) , (SEQ ID NO:21) and (SEQ ID NO:23) . 

15 3 . The nucleic acid of claim 2 having a 

nucleotide sequence corresponding to or complementary to a 
nucleotide sequence that encodes a functionally active BAG 
family protein selected from the group consisting of (SEQ 
ID NO : 2 ) , ( SEQ ID NO : 4 ) , ( SEQ ID NO : 6) , ( SEQ ID NO : 8 ) , ( SEQ 

20 ID NO:10) , (SEQ ID NO : 2 0 ) , (SEQ ID NO:22) and (SEQ ID 
NO : 24 ) . 

4 . The nucleic acid of claim 3 selected from 
the group consisting of (SEQ ID NO:l), (SEQ ID NO : 3 ) , (SEQ 
ID NO:5), (SEQ ID NO : 7 ) , (SEQ ID NO : 9 ) , (SEQ ID NO:19), 

25 (SEQ ID NO:21) and (SEQ ID NO:23) . 

5 . The nucleic acid of claim 3 complementary to 
a nucleotide sequence that encodes a functionally active 
BAG protein selected from the group consisting of (SEQ ID 
NO: 2) , (SEQ ID NO : 4 ) , (SEQ ID NO : 6 ) , (SEQ ID NO : 8 ) , (SEQ ID 

30 NO:10), (SEQ ID NO : 2 0 ) , (SEQ ID NO:22) and (SEQ ID NO:24). 



6. A substantially purified nucleic acid 
molecule having the nucleotide sequence of (SEQ ID NO : 3 ) . 
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7. A substantially purified nucleic acid 
molecule having the nucleotide sequence of (SEQ ID NO: 5) . 

8. A substantially purified nucleic acid 
molecule having the nucleotide sequence of • SEQ ID NO : 7 ) . 

5 

9. A substantially purified nucleic acid 
molecule having the nucleotide sequence of (SEQ ID NO : 9 ) . 

10. A substantially purified nucleic acid 
molecule having the nucleotide sequence of (SEQ ID NO:19). 

10 11. A substantially purified nucleic acid 

molecule having the nucleotide sequence of (SEQ ID NO:21). 

12. A substantially purified nucleic acid 
molecule having the nucleotide sequence of (SEQ ID NO: 23) . 

13. A substantially purified BAG family protein 
15 encoded by the nucleic acid molecule of claim 1. 

14. A substantially purified BAG family protein 
comprising of the amino acid sequence selected from the 
group consisting of (SEQ ID NO : 2 ) , (SEQ ID NO : 4 ) , (SEQ ID 
NO: 6) , (SEQ ID NO : 8 ) , ( SEQ ID NO: 10) , (SEQ ID NO : 2 0 ) , (SEQ 

20 ID NO: 22) and (SEQ ID NO: 24) or a fragment, a derivative or 
a mimetic thereof. 

15. A substantially purified protein 
corresponding to the amino acid sequence of 157 to 204 of 
(SEQ ID NO: 2) . 

25 lb. A substantially purified protein 

corresponding to the amino acid sequence of 272 to 319 of 
(SEQ ID N0:2) . 
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17. A substantially purified protein 
corresponding to the amino acid sequence of 164 to 211 of 
( SEC' ID NO : 4 ) . 

18. A substantially purified protein 
5 corresponding to the amino acid sequence of 418 to 510 of 

(SEC ID NO: 20) . 

19. A substantially purified protein 
corresponding to the amino acid sequence of 378 to 457 of 
(SEQ ID NO: 22) . 

10 20. A substantially purified protein 

corresponding to the amino acid sequence of 6 to 97 of (SEQ 
ID NO: 24) . 

21. A substantially purified protein 
corresponding to the amino acid sequence of 180 to 257 of 

15 (SEQ ID NO: 24) . 

22 . A substantially purified protein 
corresponding to the amino acid sequence of 272 to 349 of 
(SEQ ID NO: 24) . 

23 . A substantially purified protein 

20 corresponding to the amino acid sequence of 362 to 444 of 
(SEQ ID NO:24) . 

24. A pharmaceutical composition comprising a 
nucleic acid molecule of claim 1 useful for modulating 
tumor cell proliferation, cell migration and metastasis, 

25 and steroid hormone receptor function. 

25. A method of modulating tumor cell 
proliferation, cell migration and metastasis, and steroid 
hormone receptor function by administering a nucleic acid 
molecule of claim 1. 
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26. A pharmaceutical composition comprising a 
substantially purified BAG family protein comprising of the 
amino acid sequence selected from the group consisting of 
(SEQ ID NO: 2) , (SEQ ID NO : 4 ) , (SEQ ID NO : 6 ) , (SEQ ID NO : 8 ) , 
5 (SEQ ID NO:i0), (SEQ ID NO:20), (SEQ ID NO:22) and (SEQ ID 
NO: 24), or a fragment, a derivative or a mimetic thereof, 
useful for modulating tumor cell proliferation, cell 
migration and metastasis, and steroid hormone receptor 
function . 



10 27. A method of modulating tumor cell 

proliferation by administering a pharmaceutical composition 
of claim 2 6 . 

28. A method of modulating cell migration and 
metastasis by administering a pharmaceutical composition of 
15 claim 26 . 



29. A method of modulating steroid hormone 
receptor function by administering a pharmaceutical 
composition of claim 26 . 

30. A substantially purified antibody that 
20 specifically binds to a BAG family protein of claim 14. 

31. The antibody of claim 30, wherein said 
antibody is a monoclonal antibody. 
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32. A method for detecting the presence of a BAG 
family protein in a sample, comprising the steps of: 

a. obtaining the sample; 

b. adding to said an antibody of claim 11 
5 under suitable conditions for the 

binding of said antibody with the BAG 
family protein; and 

c. detecting said bound BAG family 
protein . 

10 33 . A method for detecting the presence of a 

first nucleic acid molecule that encodes a BAG family 
protein in a sample, comprising the steps of: 

a. obtaining the sample; 

b. adding to said sample a second nucleic 
15 acid molecule capable of hybridizing 

with said first nucleic acid molecule 
under suitable conditions for the 
binding of said second nucleic acid 
molecule with said first nucleic acid 
20 molecule; and 

c . detecting said hybridized first and 
second nucleic acid molecules . 

34. A method of determining the risk of 
metastatic spread of cancer or prognosis of cancer patients 
25 by determining the level of expression of a BAG- family 
protein . 
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FIGURE 2A 
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FIGURE 2B 

O O O O C* 

o o cd r- 
D Cft o ^- *- 



oc cn <r i— 

o cn h~ 

a: c_> o cn 

o i — cn cn 

i— oc cn cn 

I — i — oc CD 

o o oc oc 

CC I — CD I — 

I — CC CD O 

I — I — O CC 



o 

CC oc 

o cn 

O I — 

cn i — 

cn h- 

h- j — 

cc cn 

i — h— 

cd cn 

o cc 

cc o 

h- CD 

f- cn 

CD f— 

I— O 

I — OC 

f- oc 

o cn 

h- cn 

i — o 

CD OC 

CC CC 

h- I — 

o cc 

cc cn 



I — CD J — 



CD 



CD 
CC 



CD I — 

cn i — 

i — cn 

cn o 

o o 

cc CD 

I — CD 

cn f— 

oc I— 

CD o 

o 

cn cn 

o o 

CD »— 

I — o 

CD CC 

I — o 

o o 
o 
cc 



cc cd cn o 



cn cd o cc 
cd i — cn cn 



CC 1 — 

CC CD 

i — cn 

i — o 



o o 

> — CD 

I — t~ 

i — cn 

cc o 

i — o 

CC CD 

i — I — O f— 

OC O CD O 

i — i — OC h 

O OC I — CD 

(T h O h 

I — CD I — O 

oc cn cc o 

OC CC CD I— 

OC O CD I- 

cn i — a: h 

i — cn i — I— 

cc o o »— 
o o cn 



cn j — 

cn cn 

oc 

oc oc 

cn oc 

I — oc 

cd cn 

I — CD 

CD CD 

CD CC 

OC CD 

OC t~ 

o cn 

cn cd 

o cn 

cn o 

I — H- 

cn i — 

cn t— 



o 

i — o 

o f — 

I — o 

I — oc 

O CD 

I — I — 

CD CC 

h- CJ 

t— CD 

O CC 

*— cn 

f- cn 

cn cn 

cn cn 

CD o 

t- CD 

cn cd 

O CD 

cn cn 



o CD h- 

CC H- I— 

O f- 

CC CD I — 

i — cn i— 



cn 



OC 
CD 



CC 
CD 



O CD 

CD CD 

CC I — 

CJ CJ 

8 E 

CC o 



BNSDOCID <WG 0014106A1 i > 



WO 00/14106 



4/35 



PCT/US99/21053 



FIGURE 3 

gcgoagctcc ccnrcavcc ccgggccgcg accmcrrcx crtjcacTCGR cc«Gr«Grrr ctagccggcc rgttcctrcc jccctttrtc ga 

R £ L R IQP RflR R H F S GLO Q K F LRCQ LLP P F I 

TtXTTCCTTCC CCTCTGCCRG CO^GGfKXXT fiTTfCCrtGnC RCT7CCRCCC CTCTCTOGCC RCGTCRCCCC CGCCTTTRRT TCATRRRGGT t80 
SSFP SGS £ E R ( S R H F H P SLR TSPP PL( H K G 

CCCCCOCCCC GCCTTCCCGG RCttCGTCGGC GGCGORQRGG GGCCCRC&3C GGCGGCCCGG OCflGftGftCTC GCOGCCCGGR GCCRGCGCCC 270 
RRRR LPG HUG G G E G PTR RRR P E T R RPE PRP 

CXX^CCCCCG CCCCRGCX5GG CRCRCCCCAR CXXAQCRTGR GCGCCCCCftC CCRCTCGCCC RTGRTGCRGG TX5GCGTCCGG CRRCGGTGRC 3C0 
RTRR PRG RPQ PSttS RRT HSP ft ft Q V RSC «GO 

CSCGRCCCTT TGCCCCCCGG RTGGGRGRTC R^GRTCORCC CGCRGRCCGG CTGGCCCTTC TTCGTGGACC RGRRCRGCCG CRCCRCTRCO 450 
ROPL PPG U E ( KIOP QTG UPF FUOH «SR TTT 

TGOVCCACC CGCCCGTGCC CTCTGRGGCC CCCPAGGttGR CTCCRTCCTC TGCCRRTGGC CCTTCCCGGG RGGCCTCTAG GCTGCCGCCT 540 
UMOP R U P SEC PKET PSS fl M G PSRE GSR LPP 

GCTRGGGRRO GCCRCCC 7 G T G7RCCCCCRG CICCGftCCAG GCTRCRTTCC CRTTCCTGTG CTCCRTGRRG GCGCTGRGRfl CCGGCRGGTG 630 
RREG H P U YPQ LRPG VtP 1 P U LHEG REM R Q U 

CRCCCTT7CC RTGTCTRTCC CCnGCCTGGC RTGCRGCGAT TCCCARCTGA GGCCGCROCR GCGGCTCCTC RGRGGTCCCfl GTCRCCTCTG 720 
H P F H UVP QPG M Q R F RTE RRR fl R P Q RSQ SPL 

CGGGGCRTGC CRGRRRCX^RC TCRGCCRGRT RRRCRGTGTG GRORGGTGGC RGCGGOGGCG GCf¥3CCCfYX CCCCHGCCTC CCBCGGRCCT 8 JO 
ROtlP ETT QP0 KQCG QUR RRR RRQP PRS HGP 

GRGCGGTCCC RGTCTCCRGC TGCCTCTGRC TGCTCRTCCT CRTCCTCCTC GGCCRGCCTG CCTTCCTCCG GCRGGRGCRG CCTGGGCRGT «00 
ERSQ SPR RSD CSSS SSS RSL PSSC RSS LGS 

CRCCRGCTCG CGCGCGGGTR CRTCTCCRTT CCGGTGRTAC RCGRGCRGf*l CCTTRCCCGG CCTCCRGCCC RCCCCTCCTT CCRCRRRGCC C*50 
HQLP RGY ISt P U 1 H E Q ti U T R PRRQ PSF H K fl 

CRGR^GRCCC RCTRCCCRGC GCRGRGGGGT CRGTRCCRGR CCCRCCRGCC TGTGTRCCRC RRGRTCCRGG GGGRTGRCTG GGRGCCCCGG 1O80 
QKTH VPR QRG EYQT HQP UVH KIQG DDK £PR 

CCCCTGCGGG CGGCRTCCCC GTTCRGGTCR 7CTGTCCRCG GTGCRTCGRG CCGGGRGGGC TC«CCRGCCR GGRGCRGCRC GCCflCTCC&C 1 170 
PLRR RSP FRS S U Q G RSS REG SPAR SST PLH 

TCCCCCTCGC CCRTCCGTG7 GCRCRCCGTG OTCGRCOGGC CTCRGCRGCC CRTGRCCCRT CCRGRRRCTG CRCCTGTTTC CCRGCCTGRR 1260 
SPSP IRU HTU UORP Q G P n T H RETR PUS QPE 

RRCRRRCCRG RRRGTRRGCC RCJGCCCRGT7 GGRCCRGRRC TCCCTCCTGG RCfiCRTCCCFi RTTCRftGTGfl TCCGCRRRGR GGTGGRTTCT 1350 
N K P E SKP GPU GPEL PPG HIP I Q U I RKE UDS 

RRRCCTGTT7 CCCRGRRGCC CtXRCCTCCC TCTG*GFWGG TflGRGGTGRfl RGTTCCCCCT GCTCCRGTTC CTTGTCCTCC TCCCRGCCCT 1440 
K P U S OKP PPP SEKU EUK UPP RPUP CPP PSP 



GGCCCTTCTG CTGTCXCCTC TTCCCCCfVK RGTGTGOCTR CRGRAGRGAG GGCRGCXXCC RGCftCTGCXX CTGCRGORGC TRCRCCTCCA 1S30 

GPSR UPS SPK SURT EER RRP STRP RER TPP 

RfVCCRGGRG RRGCCGRGGC TCOXCfWW CRTCC«GORG TGCTGRRRGT GGRRCCCflTC CTGGRGRRGG TGCRGGGGCT GGRGCRGGCT 1620 

KPGE RER P P K HPGU LKU ERI L E K U QGL EQR 

GTRGRCRRCT TTGRRGGCRfl GRRGRCTGRC RARF^GTRCC TGRTGRTCGR RGRGTRTTTG RCCfWlGRGC TGCTGGCCCT CGRTTCRGTG 1710 

U0«F EGK K T O K K V L tt I E EYL TKEL LRL OSU 

GRCCCCCRGG GRCGRGCCGR TGTGCGTCRG GCCRGGRORC RCCGTGTCRG GRRGGTTCRG RCCRTCTTGG RRRRRCTTGR RC3KJRRAGCC 18O0 

OPEG RRD URQ RRRO GUR K U Q TILE KLE Q K fl 

RTTGRTGTCC CRGGTCRRGT CCRGGTCTRT GRRCTCCRGC CCRGCRRCCT TGRRGCRGRT O^GOCRCTGC RGGCRRTCRT GGRGRTGGGT 1890 

IDUP GQU QUY ELQP SHL ERO QPLQ Rill E fl G 

GCCGTGGCRG CRGRCRRGCG CROGRRf¥^RT CCTGORWTG CRGRRCRTCC CCRCRCRGRn RCCCRGCRGC C^RRGCCRC RGCRGCRGCG 1«80 

RUfiR OKG KKH RGMR EOP HTE TQQP ERT RRR 

RCTTCRRACC CX^GCOGCRT GRiC«GaC«CC CCTGGTROCC C^GOXX^CC GTRGCCTCTG CCCTGTRRRR GTCRGRCTCG GRACCGRTGT 2070 

TSHP SSM TOT PGMP RRP 

GTGCTTTRCG GRTTTTRGTT GCRTGCRTTT O^C^G^CTTT RGGTCRGTTG GTTTTGRTTR GCTGCTTGGT RTGCROTRCT TGGGTGRGGC 2160 

ROnCRCTRTR ROGGCCTRRR RGGGPrfWTG RTGCTTTTCT TCRRTRTTCT TRCTCTTGTR C^RTTRRMGR ROTTGCTTGT TGTTTGRGRR 2250 

GTTTRROCCC GTTGCTTGTT CTGCRGCCCT GTCMRCTTGO GCRCCGXXRC CRCCTGTTAC CT GTGGTTGT GCRCTGTCTT TTGTRGCTCT 2340 

O0RCTGORO0 GGTRGRTGGG GROTCRRTTR CCCRTCnCRT RTYUATGRRR CRTTTRTCRO RORTOTTGCC RTTTTRRTOR CRTGRTTTTC 2430 

TTCRTCTCRT RRTTRRRRTfl CCTGOCTTTR ORC^GOOTRn RflTGTGCCRG GRGCCRTROO RRTRTCTOTR T0TT0ORT0R CTTTRRTOCT 2520 
RCRTTTTH -2528 
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FIGURE 4 



O O 
O CO 



o 


O 


o 


O 


o 


o 


o 


o 


o 


o 


r- 




lO 




CO 


(N 




o 


cn 




<N 


CO 




in 




n- 


co 


cn 


cn 


o 



o 


I — 


fr— 


CD 


CD 




CC 




O 




cn 




fr- 




CC 




cc 






fr— 


1 — 


fr — 


CD CC 


CD 


en 


O 


CD 


fr— 


CO 


CD 


a 


ee 





0 


LU 


cc 






o 


O 


CD 


CD 


CC 




O 




O 




fr— 




CD 




cc 




fr— 






o 


o 


O 


CD 


CD 




CC 




O 




CD 




fr- 




cn 




CD 


CD 






o 


CC 


CD CD 


| — 


CO 


0 


CD 


fr- 


CO 


CC 


CO 


ee 




0 


LU 


fr— 




o 


cc 


o 


CD 


CD 




0 




CC 




O 




cc 




CD 




fr— 






o 


I — 


CD 


cn 


CD 




fr— 




cc 




O 




cc 




fr— 




fr— 






cc 


1 — 


CD 


cd re 


CD 


Cl 


CD 


z> 


0 


UJ 


CC 


fr- 


cc 




0 


_J 


O 


ZD 




o 




O 


i — 


CD 




CC 




CD 




CD 




ce 




fr- 




fr— 






o 


cc 


o 


cc 


CD 




CD 




cc 




0 




cc 




fr- 




O 






o 


1 — 


cc 


cn 2c 


CD 


cc 


CD 


CL 


1 — 


D- 


CD 


ee 


cc 




0 


>_l 


CD 


ce 




CJ 


CO 


cc 


CD 


cn 




O 




CD 




fr- 




1— 




CD 




CD 






CD 


cc 


1 — 


cn 


CD 




CD 




cc 




ee 




fr- 




O 




ce 






CD 


cc 


fr— 


CD DC 


fr- 


CO 


CD 


Ci 


CD 


CD 


cn 


TL 


CC 





fr— 


d 


CD 






ee 


CJ 


CD 


CD 


ee 




CC 




( — 




fr— 




fr— 




0 




cc 






1 — 


c_> 


I — 


I — 


CD 




0 




cn 




CD 




0 




ce 




GC 








1 


C_) 


1 jj_ 


1 


Co 


1 — 


CO 


CD 


DC 


CD 


CD 


cn 


CO 


fr- 


D- 


CC 






c_) 


1 — 


o 


CD 


I — 




1 — 




O 




fr- 




0 




ee 




cc 






cn 


CD 


CD 


| — 


CD 




0 




fr— 




ee 




0 




0 




CD 






cc 


cc 


CD 


CD ZD 


CD 


CL 


0 


Cl. 


CD 


z> 


fr- 


> 


0 


CL 


0 


cc 


CC 






cc 


CD 


CC 


CD 




CD 




fr— 




0 




fr— 




cc 




0 






(_) 


CD 


CC 


( — 


CD 




CD 




CD 




f— 




0 




cc 




CD 






} — 


cn 


CD 


CD ZD 


fr— 


er 


O 


Q 


CC 


CO 


0 


_J 


CD 


CL 


en 




CD 


cc 




CD 


o 


CD 


CD 


CD 




cc 




0 




CD 




fr— 




0 




O 






o 


cc 


CC 


| — 


CD 




CD 




CD 




CC 




O 




cc 




CC 






f — 


CD 


cc 


CC — 


CD 


Cl. 


fr— 


CO 


fr— 


0 


0 


CD 


CC 


fr— 


0 


CD 


0 


CD 




cn 


O 


CD 


cc 


fr- 




O 




fr— 




fr— 




fr— 




cc 




CD 






fr— 


1 — 


fr— 


fr— 


ee 




CC 




CD 




0 




O 




0 




CD 






o 


CD 


CD 


CD ZD- 


i — 


D- 


0 


CD 


O 


0_ 


CD 


CL 


CC 


CO 


cc 


fr— 


O 


CC 




o 


cc 


CC 


CD 


fr— 




0 




fr— 




O 




cc 




0 




CC 








fr— 


CD 


1 — 


CD 




CD 




fr— 




CC 




CC 




ee 




fr— 






cn 


1 — 


CC 


cn 2z 


CD 


cc 


CD 


CL 


fr— 


L_ 


CD 


LU 


0 


LU 


ce 




CD 


ID 




o 


CD 


CC 


CD 


CD 




fr— 




CD 




fr— 




fr- 




cc 




fr— 






o 


cc 


CD 


cc 


CD 




CD 




CC 




O 




ee 




cn 




O 






i — 




| — 


CD UJ 


CD 


CD 


fr— 


CD 


CC 


-7~ 


O 


CC 


0 


Q 


cc 




fr— 


CO 




n~ 


c_> 


CD 


CD 


( — 




0 




0 




fr— 




cc 




cc 




O 






CC 


cc 


1 — 


I — 


CD 




0 




ce 




O 




0 




0 




CC 






rp 


CD 


CD 


CC 2Z 


CD 


CD 


CD 


CD 


0 


2C 


GC 


CO 


fr— 


CO 


CD 


O 


0 


a 




Cj 


cn 


cn 


f — 


CD 




| — 




CD 








fr— 




GC 




CD 






cn 


CD 


CD 


fr- 


fr- 




CD 




CD 




CC 




0 




^— 




ee 








1 — 


CD 


CD 


ee 


3— 


CC 


CO 


CD 


CC 


\ — 


D- 


0 


CL 


0 


ZD 


0 


CD 




cc 


C_j 


CC 


CD 


cn 




CD 




CD 




0 




cn 




fr- 




CD 








cc 


CD 


t — 


CD 




O 




CC 




cc 




fr— 




fr— 




CD 


O 




\ — 


1 — 


CD 


ce 


CD 


O 


cc 


CO 


cc 


2Z 


0 


CD 


0 


0 


fr- 


U_ 


O 




CD 


cc 


CC 


1 — 


( — 




1 — 




CD 




CD 




fr— 




ee 




O 






1 — 


1 — 


CD 


1 — 


ce 




CD 




| — 




fr— 




0 




cc 




CD 


CD 




J_ 


\ — 


CD 


ce 


i — 


-y. 


CD 


CL 


CC 


xz 


CD 


3 


fr- 


0 


0 


LU 


CD 




cn 


o 


CD 


CD 


i — 




CD 




CD 




cc 




ee 




en 




fr— 






cn 


CD 

•— 


CD 


C_) 


CD 




CD 




CD 




cc 




cc 




ce 




O 






o 


fr- 


CD 


CD 


CC 


fr— 


zx 


CC 


CO 


0 


CD* 


CD 


LU 


0 


UJ 


CC 


fr— 




i - 


c_> 


ee 


CD 


I — 




cn 




cn 




CD 




cn 




en 




cc 






i 


CD 


1 — 


CD 


CD 




CD 




ce 




O 




cc 








cc 


LU 




i — 


cc 


CD 


1 — 


CD 


CD 


CD 


Ci 


0 


CD 


) — 


CO 


0 


LU 


0 


ZD 


0 




o 


CD 


1 — 


CD 


I — 




cc 




1 — 




fr- 




h— 




cn 




fr— 






cn 


o 


cn 


CD 


CD 




CD 




cn 




ee 




0 




en 




fr— 






o 


1 — 


CD 


CD 


CD 


n 


| — 


CO 


CD 


Q 


0 


CD 


0 


CL 


0 


LU 


0 


ZD 




cc 


cc 


CD 


CC 


| — 




fr— 




cn 




0 




fr— 




cc 




cc 






o 


1 — 


1 — 


CD 


CD 




CD 




CD 




fr— 




fr— 




cc 




0 


CO 




o 


fr- 


fr— 


fr- 


fr— 


CO 


cn 


fr— 


fr- 


CO 


fr— 


_J 


0 


-J 


0 


CD 


fr— 




o 


CC 


CD 


ee 


fr— 




fr— 




ee 




fr— 




1 — 




cc 




fr- 






o 




CD 


cc 


CD 




CD 




cc 




fr— 




CD 




cc 




ee 






cc 


fr— 


fr— 


CD 


CD 


CC 


cn 


CO 


CD 


CD- 


CD 


_J 


CC 


CO 


0 


LU 


0 


0 




cc 


o 


CD 


CD 


CD 




cn 




CD 




fr- 




0 




fr— 




0 






o 


CD 


fr— 


fr— 


CD 




cc 




CD 




CC 




0 




fr- 




fr— 






CO 


CD 


CD 


CD 


fr— 


zx 


CD 


111 


CC 


CO 


0 


Q 


cc 


CO 


0 


— I 


CD 


_J 




fr— 


CD 


fr— 


CD 


CD 




fr— 




fr- 




cc 




fr- 




fr- 




CC 






fr- 


CD 


CD 


CD 


CD 




O 




CC 




0 




0 




ee 




cc 






cc 


CC 


CC 


CC 


CD 


cc 


CC 


fr- 


fr— 


D- 


fr— 


CO 


CC 


CO 


fr— 


D- 


CD 


UJ 




cc 


CD 


CD 


cc 


fr- 




CD 




0 




fr- 




CC 




0 




O 






CD 


fr— 


CD 


CD 


ee 




fr- 




0 




CC 




cc 




cc 




fr— 






cc 


CD 


fr— 


ce 


CD 


Q 


ee 


iz 


0 


CL 


CD 


Q 


0 


CD 


0 


ezr 


fr— 


— 1 


ce 


cc 


CC 


CD 


CD 


ce 




0 




CD 




fr- 




fr- 




0 




fr- 




cc 


o 


fr- 


CC 


CC 


cc 




cc 




cc 




ee 




ee 




fr- 




fr- 




cc 


o 


ee 


CC 


CD 


cc 




fr- 


D~ 


fr- 


D- 


0 


a 


0 


a 


CD 


z> 


O 


—J 


O LU 


cc 


CD 


CD 


fr- 


CD 




0 




CC 




fr- 




CC 




O 




CC 




CD 


CD 


fr— 


CD 


fr— 


ce 




fr- 




0 




ee 




cc 




ee 




ce 




fr— 


cc 


CD 


CD 


CD 


cc 




0 


_J 


fr— 


CO 


cc 


7Z 


0 


CD 


cc 




CD 


LU 


fr— _J 


cc 


CD 


CD 


CD 


CD 




fr- 




fr— 




CD 




fr- 




0 




0 




cc 


fr- 


CD 


CC 


CD 


CC 




ee 




CD 




en 




CC 




cc 




cc 




fr- 


CD 


fr— 


CD 


CD 


CD 


ZC 


cc 


2Z 


fr— 


CO 


cc 


71 


cc 


2C 


0 


LU 


GC 


^ 


ee — 


fr- 


O 


CC 


CD 


CD 




CD 




fr- 




CD 




0 




CD 




CD 




CD 


o 


CC 


cc 


fr- 


CC 




0 




ee 




fr- 




cc 




t — 




O 




O 


o 


fr- 


CD 


CC 


CD 


CD 


CD 


O 


0 


CD 


CD 


O 


cc 


2C 


CD 


1 


CC 


fr— 


O CC 


fr- 


ee 


CD 


fr- 


CD 




CD 




CD 




CC 




0 




CD 




CC 




CD 


cc 


CC 


CD 


ee 


CC 




CD 




CC 




0 




CD 




fr- 




fr— 




CC 


1— 


fr— 


CD 


fr— 


D- 


O 


CL 


CC 


^£ 


CC 


fr- 


0 


Cl 


O 


ZD 


0 


— 1 


O O 






CD 


CD 


CD 




CC 




O 




CD 




fr- 




fr- 




CD 




fr— 


g 




fr- 


CC 


fr- 




0 




O 




0 




ee 




CC 




fr- 




fr- 


o 


CD 


ee 


CD 


CD 


_J 


CD 


CL 


CD 


CL 


0 


CD 


CJ 


X 


0 


DC 


ee 


JZ 


CC — 


cc 


CD 


CD 


CD 


CC 




cc 




CD 




0 




0 




ce 




GC 




CD 



BNSDOCID <WC 



0014106A1 



WO 00/14106 



6/35 PCT/US99/21053 
FIGURE 5 



o 


o 


O 


O 


o 


o 




CO 


r- 


<£*> 


m 




CO 


CO 






CO 




to 







i — 


_J 


0 


0 


fr— 


cn 


O 


_J 


fr- 


> 


fr— 




fr— 




o 




h- 




CD 




CD 




0 




CD 




fr— 




a: 




fr- 




I — 




fr— 




0 




fr- 




0 




o 


cx 


cc 


— 


I — 


_J 


O 


_J 


cc 


CO 


ee 




CD 




cc 




fr— 




O 




O 




0 




fr— 




fr— 




o 




cc 




CC 




CC 




fr- 




0 




0 




o 


CD 


fr— 


D- 


cc 


2Z 


0 


err 


0 


I 


fr— 




0 




f- 




cc 




cc 




0 




fr- 




fr— 




fr— 




fr— 




0 




CD 




cc 




fr- 




0 




CJ 




cc 




cc 


fr— 


O 


CD 


cc 




ee 





CD 




fr- 




cc 




0 




fr— 




0 




fr- 




fr- 




ee 




1 — 




1 — 




fr— 




0 




ee 




CD 




fr— 




I — 


_J 


cc 





CJ 


1 


cc 


fr- 


cc 


2C 


fr- 




CD 




1 — 




0 




O 




CJ 




CD 




ee 




cc 




cd 




fr— 




( — 




fr— 




cc 




fr- 




CJ 




CD 


0 


0 


__J 


O 


— ► 


0 


_J 


CJ 


CD 


ee 




fr— 




O 




fr— 




O 




0 




CD 




CJ 




fr- 




cc 




0 




CC 




fr— 




0 




fr— 




CC 




o 


CJ 


cc 


" — 


CC 


ZC 


0 


— 1 


0 


cc 


1 — 




1 — 




CD 




cc 




CD 




0 




fr- 




CJ 




0 




I — 




cc 




CD 




CC 




fr— 




ce 




fr— 




| — 


1 


CJ 


CD* 


| — 


cx 


0 


LU 


0 


_J 


0 




fr- 




cc 




0 




O 




cc 




0 




CD 




ee 




cc 




fr— 




fr— 




cc 




0 




fr— 




cc 




CD 


LU 


0 


O 


O 


0 


0 


LU 


cc 


cc 


fr— 




cc 




cc 




0 




O 




0 




0 




fr— 




0 




o 




cc 




O 




fr— 




fr— 




fr— 




0 




<r 


fr— 


CD 


LU 


O 


cc 


0 


— ! 


CD 


ZD 


CD 




ee 




cc 




0 




CC 




0 




fr— 




t — 




ee 




cc 




fr— 




cc 




0 




CJ 




0 




0 




cc 




cc 


_ 


cc 


^1 


0 


cc 


0 


cc 


CC 




j— 




CD 




CD 




fr- 




0 




cc 




fr- 




fr- 


ee 


o 




\ — 




CC 




fr- 




CC 




ee 




ee 


cc 


1 — 


CO 


CD 




CJ 


cc 


CC 


— 


0 


CD 


0 




0 


cc 


(_) 




cc 




CJ 




0 




cc 




fr— 




fr- 


cc 


CD 








CJ 




cc 




cc 




fr— 




ee 


cc 


cc 


CO 


0 


CC 


fr- 


CO 


fr— 


Z> 


cc 




fr— 




fr— 


ce 


o 


cc 




CC 




0 




0 




fr- 




{— 


0 


fr— 




CD 




0 




cc 




0 




0 




fr- 


1 — 


CJ 


1 


cc 


cc 


0 


Q_ 


cc 


2C 


cc 


cc 


CC 




ee 


ee 


<_) 




cc 




0 




CD 




0 




0 




\ — 


CD 


cc 




CD 




cc 




cc 




0 




fr— 




CD 


CD 


fr— 


3- 


cc 


cc 


0 


DC 


cc 




0 


cc 


0 




cc 


CJ 


CD 




0 




CD 




fr- 




fr— 




fr- 




0 


fr— 


| — 




CD 




cc 




CC 




0 




ee 




fr— 


fr- 


| — 


f 


cc 


cc 


CD 


LU 


0 


Q 


0 


cc 


0 




0 


ee 


cc 




0 




0 




0 




CD 




CC 




fr— 


0 


cc 




0 




cc 




0 




cc 




0 




CD 


CJ 


CD 


LlI 


0 


cc 


0 


LU 


cc 


fr- 


cc 




cc 




CC 


1 — 


( — 




cc 




j — 




cc 




fr— 




0 




CJ 


fr— 


o 




cc 




CD 




CD 




0 




0 




fr— 


•— 


1 — 


CO 


CD 


LU 


fr— 


CJ 


0 


ee 


fr— 


CJ 


cc 




fr— 


fr— 


1 — 




CD 




I — 




fr- 




0 




fr- 




fr- 


0 


CJ 




0 




CJ 




CC 




cc 




ee 




ee 


cc 


o 


a_ 


0 


cc 


CD 


CC 


cc 


2Z 


cc 




cc 




fr- 


cc 


o 




0 




fr— 




cc 




CD 




cc 




ee 


fr— 


cn 




I — 




fr— 




CD 




cc 




0 




0 


fr— 


cc 




cc 


— 


fr— 


Lu 


CD 


0 


0 


LU 


fr— 


- 


CD 


CD 


^7— 








CD 




fr- 




cc 








1 


CJ 


cc 




CD 




fr— 




ee 




cc 




cc 




CD 


CC 


0 


0 


fr— 


0 


O 


_J 


CD 


Q 


CD 


LU 


fr- 


D- 


O 


0 


cc 




0 




CD 




fr— 




cc 




CD 




CC 


CD 


0 




0 




CC 




fr— 




CD 




CC 




fr- 


CC 


0 


cc 


0 


Ol 


cc 




fr- 


Ll 


CD 


0 


CD 


LU 


ee 


0 


cc 




0 




cc 




ee 




CD 




0 




fr— 


CC 


cc 




cc 




CD 




0 




CC 




CD 




fr— 


fr— 


0 


cr 


cc 


21 


CC 


CC 


fr— 


CO 


0 


CD 


fr- 


er 


fr- 


fr- 


0 




cc 




cc 




fr— 




CD 




ee 




ee 


ee 


1 








nJL. 




1 




y } 




rr- 
vj. - 




CD 




0 


—I 


cc 




CC 




0 


— I 


CJ 


a_ 


0 


LU 


fr— 


fr- 






cc 




CD 




fr— 




fr- 




fr- 




fr- 


ee 


fr- 




cc 




CC 




fr- 




ee 




ee 




ee 


fr- 


0 


-J 


0 


LU 


CD 


LU 


0 


0 


0 


Q 


0 


0 


CJ 


ee 


cc 




fr- 




fr— 




cc 




fr- 




fr— 




fr— 


ee 


cc 




fr- 




fr— 




cc 




fr- 




0 




fr- 


ce 


0 


LU 


iz: 


X 


CJ 


— 1 


CD 


LU 


CD 


z> 


fr- 


CO 


CD 


0 


t— 




1 — 




0 




cc 




fr- 




ee 




CC 


ee 


cc 




0 




0 




0 




CJ 




cc 




0 


0 


cc 


n 


cc 


CO 


CD 


CC 


CD 


CD 


0 


cc 


cc 




fr— 


ce 


cc 




cc 




CD 




CD 




fr- 




CD 




fr— 


cc 


cc 




fr— 




CC 




cc 




ee 




fr— 




fr- 


0 


cc 




CD 


ZD 


O 


LU 


0 


0 


0 


Q 


CJ 


_J 


0 


CD 


cc 




CD 




O 




0 




0 




0 




CD 


fr- 


1 — 




cc 




CC 




fr- 




fr- 




cc 




CC 


fr- 


cc 




0 


LU 


cc 




CC 




CJ 


— 1 


CD 


Q 


O 


O 


cc 




fr- 




CD 




CD 




0 




0 




CC 


fr- 


CC 




ee 




h- 




cc 




0 




t— 




O 


ee 


0 


Ui 


CD 


a 


fr— 




CD 


LU 


CD 


cc 


0 


— 1 


CC 


CD 


cc 




0 




CJ 




fr- 




cc 




fr- 




fr- 


\— 


0 




I — 




CC 




0 




fr- 




CC 




ee 


t— 



WO 00/14106 





PCT/US99/21053 



7 / 35 



FIGURE 6A 



ATGTCTTTCCGCCTCTTCGTTGAAA 
GCTTTGGTTTTT 

CGAGAAAACCACGTTCCAA/^TCAGCGAC^TCT 
CTCAAATTATG 

CTTCTCATATTGCATG AG CATTTTG AAG CCCG CXSTCATCAACCAAAGCATTTTTTCGACCC^TOV 
CAATG ATTTT AT CATTTTCTTTAAAATT 
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FIGURE 6B 



MKVNVSCSSV QTTIDILEEN 
LKRGKFLQGA DDVSLSTLNF 
NLSNLiQKAYD LNL.RDVADL.E 
LETLDGMN I I TETTPENQ AX 
QSVLiNGDIPE 



QGEDESILTL, GQLRDRIATD 

KENDKI IVMG GKNALVDDAG 

RGFLEKPKQV EMGKKL.EKKV 

RNRE KRKTLV NGIQTLLNQN 



KDVDVETMKL 50 
FKMLMQYEKH 100 
KYFNEEAERK 150 
DALiLRRIX)EY 200 
210 
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ATGCCAGTCG TGAACATACC 
TAGTCGAAGT AACTCCTCGT 
CACAGCAGCC ACCTCAACCG 
CAGCAGGCTC CAAACGTGAA 
ACCTAACTTC CCATCTCGTA 
CTGGGTTCCC AAACGATTCT 
AATTTCCCAA GTGGATTCTC 
AAGATTCGGA AGAGATGGAG 
ACAGGAGAAG TCCAACACCA 
AGACGCAACT CTCAGCAGAA 
ACCACAACAA GCTCAACAAC 
CATCTCGACC ACCATCTCGT 
GAGAGACCAG CAGTTATTCC 
GGAGAAGAAA GGTAGTCGTG 
AG AACATTC C CAAGATCACG 
GAACAAGAAA CGGACGGCGA 
AGGAAAGCCA AAGAGAGGAA 
ATTTCAATGC CAAGACAATT 
GAGCAGTTGA GAAAAAAAGC 
TCTTCGTTCT CTAGGAGAAA 
AAGAATGTGA TCGTGAAGAG 
AGAACAAAGA CAGTTCAAGT 
GAAAAAAGCA CTGGAAGATG 
TGATGCATTC GAATATTGAA 
AACGCCTGTT CGTACGAAGA 
GAAGATCATA ATTCAGTGCG 
GTCTGGAAAA TCTGATGTCT 
GATTTGATGG ATGATCAAAG 



FIGURE 7A 

AATCAAAATA CTTGGTCAGA 
CTTCTGTTGA CAACGATCGA 
CAACCACAAC AGCAATCTCA 
TACCAATATG CATGATTCCA 
GTCCTATTCC GGACTTTCCC 
GAATGGTCTT CGAATTTCCC 
AAATGGAAGT TCTAATTTCC 
GACTATCGCC AAACCCACCG 
ACATCAACTC AATCTCCAAC 
TCAAGCTCCT CCACAATATT 
GTCAGACAAC TCCTCCGTCA 
ACTCGTGAAC CAAAGGAACC 
ATTGCCATAT GAGAAGAAGG 
ATTCTGGAAA GGGTGATGAG 
ATCGGAAAGA ATAATTGCGA 
CCCATCTCCA CTAACCTCCC 
AGAAACTTCA ACGTAATCAA 
GTTACTTTGG ATAAAATTGA 
TGCTGAACTC GAAATGGAAA 
TCAGTGTTCA TAA CTGCATG 
ATTGAAGCAA TCACTGACCG 
TGTTGTCGAA ACTCCACGAA 
CAACTTTGAT GATCGATGAA 
AAGGCTAAGC TGTGCCTACA 
AACTGCTGGA GCCACCTGCC 
CTGCTGATGA TCAGAAACGC 
CAAATTGAGA ATGCTGAGAG 
CGAATAG 
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FIGURE 7B 



MPWNIPIKI LGQNQSHSRS NSSSSVDNDR 

QQAPNVNTNM KHSNGFSPNF PSRSPIPDFP 

NFPSGFSNGS SNFPDFPRFG RDGGLSPNPP 

RRNSQQNQAP PQYSQQQPQQ AQQRQTTPPS 

ERPAVIPLPY EfCKEKPLEKK GSRDSGKGDE 

EQETDGDPSP LTSPITEGKP KRGKKLQRNQ 

EQLRKKAAEL EMEKEQIIiRS LGEISVHNCM 

RTKTVQVWE TPRNEEQKKA LEDATLMIDE 

NACSYEETAG ATCQNFLKII IQCAADDQKR 
DLMDDQSE 



NQPPQQPPQP QPQQQSQQQY 50 

SFSSGFPNDS EWSSKFPSFP 100 

MQGYRRSPTP TSTQS PTSTL 150 

TKASSRPPSR TREPKEPEVP 200 

NXjEENIAKIT IGKNNCELCP 250 

SWDFNAKTI VTLDKI ELQV 300 

FKLEECDREE . IEAITDRX.TK 3 50 

VGEMMHSNIE KAKLCLQTYM 400 

IKRRLENL.MS Q I ENAERTKA 4 50 

458 
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FIGURE 8A 



ATGTCAGAAA AGACTAGCAC AGTTACAATA CACTATOGAA ATCAGCGATT 5 0 

TCCGGTAG CA GTCAATCTAA ATGAGACGTT AAGTGAACTG ATTGATGATT 10 0 

TACTTGAAAC GACTGAGATT TCTGAGAAGA AAGTCAAGCT TTTTTACGCT ISO 

GGCAAGCGTT TAAAAGACAA AAAAGCCTCG TTATCAAAAT TGGGTTTAAA 2 00 

AAATCATAGT AAAATTCTAT GTATAAGACC ACATAAGCAA CAACGAGGTT 2 50 

CCAAGGAAAA AGACACGGTT GAGCCCGCTC CGAAAGCGGA AGOGGAGAAT 3 00 
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FIGURE 8B 
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FIGURE 9B 
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FIGURE 13 
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FIGURE I5A 
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FIGURE 15B 
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FIGURE 15C 
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FIGURE 16A 
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FIGURE 16A 
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FIGURE 16B 



2 



2 

CO 



2 



LU 



CO 



1_L_ 

2 



CO 



8 
I 



CO CL 



1X1 



CO 



CO 



2 



£ o b 

< uj O 
Q >- > 



BNSDOCID: <WO. _0014106A1 



WO 00/14106 



28 / 35 



PCT/US99/21053 



FIGURE 16C 
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FIGURE 17A 
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FIGURE 17A 
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FIGURE 17B 
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CCCAOCACCC CTTCCTACCT TTTCTACCCA CTCAOCTCTT AAOOwCTCAC TTACBAAATC TCATAACTAC AOCTOCTOCA ©CAOCAATCA «TOO 

CTCAAAOCTC CTCCCAATTA AACCCTTOCT CTC O CT C CCT CCTCAACAAC TCACCTCATC TCAfCOCCAT CT9CTCCTTT CTCTCTTCOC J7tO 

TOAAACACQC ATTAAACTCA CTOCTCCCTC AAOCATCTCT CTTOP^AAOC A fCTCWT Tl CCCTA AA HC C TTTCTCAOOA TOCOCTAOAA 3««0 

AATCATTTCC CAAAC 1 |C 11 AACTCOCTTC H CA AjO WCTC CCTOCXSACTC TTRTAAQACA CTOCTOOCCC 45CCTCAXXTT TTOCCTCCTC »T70 

ATOCTCTmC ACTACATCTT TOQAAATOCA fc OCHA A T A TT CTCTTTCAOA •ATPCTQATT CTCTAACTCT «T*TAO0OAC ATACTOACTT 30*0 

H WACACCTC AAACTAOem «C1MMCCA6 BACCCT1AAA ^OCTAAAAC ATCAOAOAAC OATBACOOCTT AAOAAATCTT OAOCTTCTCT 3KO 

CT O A CW TTI OCCAATCATC AAAATOOCTT B CmT CKT T VXrwCTACA AATCTTCTAC AATCOCTOOA OOCTCCCCTC AOAOOOOACT 33 10 

OCCTATTWC COOCTCACCT TCCOATACm CTOCA6CTCC AACTOCTWC AACCOCCCOC TCAACCCACC TOATTCCAAT CCAOCTTTTC 3330 

CCOACACTAC TTCTACCATT CCTTTTCTCT WTOATAATTT TACAAT«CTC TTAAAATCTT CACCAACACT TTTWTTTTT WTTWTTTT 3<2C 

TCACATCCAC T VT. 3 V 1I VC CC5ACCCTCCA CTCCACTCCT ©CQmTTCAC CTCACTXAA CCTCCAOCTC CKMCCTTCAA •COATTCTOC 3*10 

TCCCTOAOCC AOCTBACmC CTCCCACTAC ACCOATCTC4 CACOATCCCT COCWATTTT TV. mill I'm A-mCACTTOA **TT7CAOCA 3400 

tCATCCTCAC CCTCCTCTOC AACTCCTCAC CTOCTCATOC CCCCOOCTOC 0CCO0CCAAA CTCCTCCCAT TAAOC«T=T «AOOCAOCCC »4»0 

COCCACCCCA •CAACACTTT TTAAATTACA B CTC rCTTBA ATWTACCAC TCCCAAAT C A TOCTTACOCT TOACCOATAT TCTTCOOCAC 37*0 

ACTACTACTT ACATTTTAAA H lVATTTTC TAAA CTTA AA TCTCACCATT OCCTTCAAAA CTCTCCATTC TTCTTTCAAA •TtACACCTTT 3I7« 

CA CTCATTCT TTTCAAAOAA CTCTTTCTCI ACCTTTTCOC AACCTCTCCC OATCCTCTCT CACTACACC* TtCTCACCTC TTOCACC<TC 3t40 

ATTTTCAATT CTTCACATOC C T A A ITCC K ATCCAAATCA TCAOATTCAC CTTWOTCAC TCTCACCCAT OCCTTTCTTT CDACTTTCAA 4OS0 

1V. TT V T C>. TTCCTTCTAC OCXATTATTC TACTCCTCOA AT t A AOCCTC TTCACACC*0 ATTTACCTCT TCTCC«OCTr OCTOCACACC 4140 

TCTtT C r C TT AATRTCACCT ACTCCATCTA ATTCTTAAAC T&CCCTKTC ACA17MHIT CTATTTTTCT CATCTCTAAT CAAAACAATC <2>0 

TCTMCT&CAA CT5MAACCTA CTCOGC**AA ATCTCTCCCT TT*C«TCT0C ATTAAACCCT CtACTCCATC TTOAJVOC 4304 
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FIGURE 18 
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<140> 
<141> 
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<170> Patentln Ver. 2.0 

<210> 1 

<211> 1291 

< 2 1 2 > DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (4 6) . . (1080) 

<400> 1 

acgccgcgct cagcttccat cgctgggcgg tcaacaagtg cgggc ctg get cag cgc 57 

Leu Ala Gin Arg 
1 

ggg ggg gcg egg aga ccg cga ggc gac egg gag egg ctg ggt tec egg 105 
Gly Gly Ala Arg Arg Pro Arg Gly Asp Arg Glu Arg Leu Gly Ser Arg 
5 10 15 20 

ctg cgc gec ctt egg cca ggc egg gag ccg cgc cag teg gag ccc ccg 153 
Leu Arg Ala Leu Arg Pro Gly Arg Glu Pro Arg Gin Ser Glu Pro Pro 

25 30 35 

gee cag cgt ggt ccg cct ccc tct egg cgt cca cct gee egg agt act 201 
Ala Gin Arg Gly Pro Pro Pro Ser Arg Arg Pro Pro Ala Arg Ser Thr 
40 45 50 

gee age ggg cat gac cga ccc acc agg ggc gee gee gee ggc get cgc 249 
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Ala Ser Giy His Asp Arg Pro Thr Arg Giy Ala Ala Ala Gly Ala Arg 

55 60 65 

agg ccg egg atg aag aag aaa acc egg ego cgc teg acc egg age gag 297 

Arg Pro Arg Met Lys Lys Lys Thr Arg Arg Arg Ser Thr Arg Ser Glu 

70 75 80 

gag ttg acc egg age gag gag ttg acc ctg agt gag gaa gcg acc tgg 345 

Glu Leu Thr Arg Ser Glu Glu Leu Thr Leu Ser Glu Glu Ala Thr Trp 

85 90 95 100 

agt gaa gag gcg acc cag agt gag gag gcg acc cag ggc gaa gag atg 393 

Ser Glu Glu Ala Thr Gin Ser Glu Glu Ala Thr Gin Gly Glu Glu Met 

105 110 115 

aat egg age cag gag gtg acc egg gac gag gag teg acc egg age gag 44 1 

Asn Arg Ser Gin Glu Val Thr Arg Asp Glu Glu Ser Thr Arg Ser Glu 

120 125 130 

gag gtg acc agg gag gaa atg gcg gca get ggg etc acc gtg act gtc 489 

Glu Val Thr Arg Glu Glu Met Ala Ala Ala Gly Leu Thr Val Thr Val 

135 140 145 

acc cac age aat gag aag cac gac ctt cat gtt acc tec cag cag ggc 537 

Thr His Ser Asn Glu Lys His Asp Leu His Val Thr Ser Gin Gin Gly 

150 155 160 

age agt gaa cca gtt gtc caa gac ctg gee cag gtt gtt gaa gag gtc 585 

Ser Ser Glu Pro Val Val Gin Asp Leu Ala Gin Val Val Glu Glu Val 

165 170 175 180 

ata ggg gtt cca cag tct ttt cag aaa etc ata ttt aag gga aaa tct 633 

lie Gly Val Pro Gin Ser Phe Gin Lys Leu lie Phe Lys Gly Lys Ser 

185 190 195 

ctg aag gaa atg gaa aca ccg ttg tea gca ctt gga ata caa gat ggt 681 

Leu Lys Glu Met Glu Thr Pro Leu Ser Ala Leu Gly He Gin Asp Gly 

200 205 210 

tgc egg gtc atg tta att ggg aaa aag aac agt cca cag gaa gag gtt 729 

Cys Arg Val Met Leu He Gly Lys Lys Asn Ser Pro Gin Glu Glu Val 

215 220 225 

gaa eta aag aag ttg aaa cat ttg gag aag tct gtg gag aag ata get 777 

Glu Leu Lys Lys Leu Lys His Leu Glu Lys Ser Val Glu Lys He Ala 

230 235 240 

gac cag ctg gaa gag ttg aat aaa gag ctt act gga ate cag cag ggt 825 



BNSDOCID <WO 001 4106A1 i_> 



WO 00/14106 



PCT/US99/21 053 



Asp Gin Leu Glu Glu Leu Asn Lys Glu Leu Thr Gly lie Gin Gin Gly 
245 250 255 260 

ttt ctg ccc aag gat ttg caa get gaa get etc tgc aaa ctt gat agg 873 
Phe Leu Pro Lys Asp Leu Gin Ala Glu Ala Leu Cys Lys Leu Asp Arg 
265 270 275 

aga gta aaa gcc aca ata gag cag ttt atg aag ate ttg gag gag att 921 
Arg Val Lys Ala Thr lie Glu Gin Phe Met Lys lie Leu Glu Glu lie 
280 285 29C 

gac aca ctg ate ctg cca gaa aat ttc aaa gac agt aga ttg aaa agg 969 
Asp Thr Leu lie Leu Pro Glu Asn Phe Lys Asp Ser Arg Leu Lys Arg 
295 300 305 

aaa ggc ttg gta aaa aag gtt cag gca ttc eta gcc gag tgt gac aca 1017 
Lys Gly Leu Val Lys Lys Val Gin Ala Phe Leu Ala Glu Cys Asp Thr 
310 315 320 

gtg gag cag aac ate tgc cag gag act gag egg ctg cag tct aca aac 1065 
Val Glu Gin Asn lie Cys Gin Glu Thr Glu Arg Leu Gin Ser Thr Asn 
325 330 335 340 

ttt gcc ctg gcc gag tgaggtgtag cagaaaaagg ctgtgctgcc ctgaagaatg 1120 
Phe Ala Leu Ala Glu 
345 

gcgccaccag ctctgccgtc tetggategg aatttacctg atttcttcag ggctgctggg 1180 
ggcaactggc catttgecaa ttttcctact ctcacactgg ttctcaatga aaaatagtgt 1240 
ctttgtgatt tgagtaaagc tcctattctg tttttcacaa aaaaaaaaaa a 1291 



<210> 2 
<211> 345 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Leu Ala Gin Arg Gly Gly Ala Arg Arg Pro Arg Gly Asp Arg Glu Arg 
15 10 15 

Leu Gly Ser Arg Leu Arg Ala Leu Arg Pro Gly Arg Glu Pro Arg Gin 
20 25 30 

Ser Glu Pro Pro Ala Gin Arg Gly Pro Pro Pro Ser Arg Arg Pro Pro 
35 40 45 

3 



BNSDOClD <WO 0014106A1 I > 



WO 00/14106 PCT/US99/21053 



Ala Arg Ser Thr Ala Ser Gly His Asp Arg Pro Thr Arg Gly Ala Ala 

50 55 60 

Ala Gly Ala Arg Arg Pro Arg Met Lys Lys Lys Thr Arg Arg Arg Ser 
65 "70 75 80 

Thr Arg Ser Glu Glu Leu Thr Arg Ser Glu Glu Leu Thr Leu Ser Glu 
85 90 95 

Glu Ala Thr Trp Ser Glu Glu Ala Thr Gin Ser Glu Glu Ala Thr Gin 
100 105 110 

Gly Glu Glu Met Asn Arg Ser Gin Glu Val Thr Arg Asp Glu Glu Ser 
115 120 125 

Thr Arg Ser Glu Glu Val Thr Arg Glu Glu Met Ala Ala Ala Gly Leu 
130 135 140 

Thr Val Thr Val Thr His Ser Asn Glu Lys His Asp Leu His Val Thr 
145 150 155 160 



Ser Gin Gin Gly Ser Ser Glu Pro Val Val Gin Asp Leu Ala Gin Val 



165 



170 



175 



Val Glu Glu Val lie Gly Val Pro Gin Ser Phe Gin Lys Leu lie Phe 
180 185 190 

Lys Gly Lys Ser Leu Lys Glu Met Glu Thr Pro Leu Ser Ala Leu Gly 
195 200 205 

lie Gin Asp Gly Cys Arg Val Met Leu lie Gly Lys Lys Asn Ser Pro 
210 215 220 

Gin Glu Glu Val Glu Leu Lys Lys Leu Lys His Leu Glu Lys Ser Val 
225 230 235 240 

Glu Lys lie Ala Asp Gin Leu Glu Glu Leu Asn Lys Glu Leu Thr Gly 
245 250 255 

lie Gin Gin Gly Phe Leu Pro Lys Asp Leu Gin Ala Glu Ala Leu Cys 
260 265 270 

Lys Leu Asp Arg Arg Val Lys Ala Thr lie Glu Gin Phe Met Lys He 
275 280 285 



Leu Glu Glu He Asp Thr Leu He Leu Pro Glu Asn Phe Lys Asp Ser 

290 295 300 
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Arg Leu Lys Arg Lys Giy Leu Val 
3C5 310 

Glu Cys Asp Thr Val Glu Gin Asn 
325 

Gin Ser Thr Asn Phe Ala Leu Ala 
340 



Lys Lys Val Gin Ala Phe Leu Ala 
315 32 0 

lie Cys Gin Glu Thr Glu Arg Leu 
330 335 

Glu 
345 



<210> 3 
<211> 1179 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (160) . . (792) 
<400> 3 

gcagccgcgg tgtcgcgaag tcctcccggg ttgcccccgc ggcgtcagag ggagggcggg 60 

cgccgcgttg gtgacggcga ccctgcagcc caaggagcgc tccactcgct gccgccggag 120 

ggccggtgac ctcttggcta ccccgcgtcg gaggcttag atg get cag gcg aag 174 

Met Ala Gin Ala Lys 
1 5 

ate aac get aaa gec aac gag ggg cgc ttc tgc cgc tec tec tec atg 222 
He Asn Ala Lys Ala Asn Glu Gly Arg Phe Cys Arg Ser Ser Ser Met 
10 15 20 

get gac cgc tec age cgc ctg ctg gag age ctg gac cag ctg gag etc 270 
Ala Asp Arg Ser Ser Arg Leu Leu Glu Ser Leu Asp Gin Leu Glu Leu 
25 30 35 

agg gtt gaa get ttg aga gaa gca gca act get gtt gag caa gag aaa 318 
Arg Val Glu Ala Leu Arg Glu Ala Ala Thr Ala Val Glu Gin Glu Lys 
40 45 50 

gaa ate ctt ctg gaa atg ate cac agt ate caa aat age cag gac atg 366 
Glu He Leu Leu Glu Met He His Ser He Gin Asn Ser Gin Asp Met 
55 60 65 

agg cag ate agt gac gga gaa aga gaa gaa tta aat ctg act gca aac 414 
Arg Gin He Ser Asp Gly Glu Arg Glu Glu Leu Asn Leu Thr Ala Asn 
70 75 80 85 
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cgt ttg atg gga aga act etc acc gtt gaa gtg tea gta gaa aca att 462 
Arg Leu Met Gly Arg Thr Leu Thr Val Glu Val Ser Val Glu Thr He 
90 95 100 

aga aac ccc cag cag caa gaa tec eta aag cat gee aca agg att att 510 
Arg Asn Pro Gin Gin Gin Glu Ser Leu Lys His Ala Thr Arg He He 
105 HO 115 

gat gag gtg gtc aat aag ttt ctg gat gat ttg gga aat gee aag agt 558 
Asp Glu Val Val Asn Lys Phe Leu Asp Asp Leu Gly Asn Ala Lys Ser 
120 125 130 

cat tta atg teg etc tac agt gca tgt tea tct gag gtg cca cat ggg 60h 
His Leu Met Ser Leu Tyr Ser Ala Cys Ser Ser Glu Val Pro His Gly 
135 140 145 

cca gtt gat cag aag ttt caa tec ata gta att ggc tgt get ctt gaa 654 
Pro Val Asp Gin Lys Phe Gin Ser He Val He Gly Cys Ala Leu Glu 
150 155 160 165 

gat cag aag aaa att aag aga aga tta gag act ctg ctt aga aat att 701: 
Asp Gin Lys Lys He Lys Arg Arg Leu Glu Thr Leu Leu Arg Asn He 
170 175 180 

gaa aac tct gac aag gec ate aag eta tta gag cat tct aaa gga get 750 
Glu Asn Ser Asp Lys Ala lie Lys Leu Leu Glu His Ser Lys Gly Ala 
185 190 195 

ggt tec aaa act ctg caa caa aat get gaa age aga ttc aat 79.": 
Gly Ser Lys Thr Leu Gin Gin Asn Ala Glu Ser Arg Phe Asn 
200 205 210 

tagtcttcaa acctaagagc atttacacaa tacacaaggt gtaaaaatga taaaatacta 852 

ttttaattga taactagttc tttgttaggt ataaccactt agttgacact gatagttgtt 912 

tcagatgagg aaaatattcc atcaagtatc ttcagttttg tgaataacaa aactagcaat 972 

attttaatta tctatctaga gattttttag attgaattct tgtcttgtac taggatctag 1032 

catatttcac tattctgtgg atgaatacat agtttgtggg gaaaacaaac gttcagctag 1092 

gggcaaaaag catgactget ttttcctgtc tggcatggaa tcacgcagtc accttgggca 1152 

tttagtttac tagaaattct ttactgg 1179 
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:210 
•:211 
•:212 
:213 



A 

211 
PRT 

Homo sapiens 



<400> 4 

Met Ala Gin Ala Lys lie Asn Ala Lys Ala Asn Glu Gly Arg Phe Cys 
15 10 15 

Arg Ser Ser Ser Met Ala Asp Arg Ser Ser Arg Leu Leu Glu Ser Leu 

20 25 30 

Asp Gin Leu Glu Leu Arg Val Glu Ala Leu Arg Glu Ala Ala Thr Ala 
35 40 45 

Val Glu Gin Glu Lys Glu lie Leu Leu Glu Met lie His Ser He Gin 

50 55 60 

Asn Ser Gin Asp Met Arg Gin He Ser Asp Gly Glu Arg Glu Glu Leu 
65 70 75 80 

Asn Leu Thr Ala Asn Arg Leu Met Gly Arg Thr Leu Thr Val Glu Val 

85 90 95 

Ser Val Glu Thr He Arg Asn Pro Gin Gin Gin Glu Ser Leu Lys His 
100 105 110 

Ala Thr Arg He He Asp Glu Val Val Asn Lys Phe Leu Asp Asp Leu 
115 120 125 

Gly Asn Ala Lys Ser His Leu Met Ser Leu Tyr Ser Ala Cys Ser Ser 
130 135 140 

Glu Val Pro His Gly Pro Val Asp Gin Lys Phe Gin Ser He Val lie 
145 150 155 160 

Gly Cys Ala Leu Glu Asp Gin Lys Lys He Lys Arg Arg Leu Glu Thr 
165 170 175 

Leu Leu Arg Asn He Glu Asn Ser Asp Lys Ala He Lys Leu Leu Glu 
180 185 190 

His Ser Lys Gly Ala Gly Ser Lys Thr Leu Gin Gin Asn Ala Glu Ser 
195 200 205 

Arg Phe Asn 
210 
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<210 
<21 1 
<212 



5 

2528 
DNA 



<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (2031) 



<400> 5 

gcg gag etc cgc ate caa ccc egg gee gcg gee aac ttc tct gga ctg 

Ala Glu Leu Arg lie Gin Pro Arg Ala Ala Ala Asn Phe Ser Gly Leu 
15 10 15 



48 



gac cag aag ttt eta gee ggc cag ttg eta cct ccc ttt ate tec tec 
Asp Gin Lys Phe Leu Ala Gly Gin Leu Leu Pro Pro Phe lie Ser Ser 

20 25 30 



96 



ttc ccc tct ggc age gag gag get att tec aga cac ttc cac ccc tct 
Phe Pro Ser Gly Ser Glu Glu Ala lie Ser Arg His Phe His Pro Ser 
35 40 45 



144 



ctg gec acg tea ccc ccg cct tta att cat aaa ggt gee egg cgc egg 
Leu Ala Thr Ser Pro Pro Pro Leu lie His Lys Gly Ala Arg Arg Arg 
50 55 60 



192 



ctt ccc gga cac gtc ggc ggc gga gag ggg ccc acg gcg gcg gee egg 
Leu Pro Gly His Val Gly Gly Gly Glu Gly Pro Thr Ala Ala Ala Arg 
65 70 75 80 



240 



cca gag act egg cgc ccg gag cca gcg ccc cgc ace cgc gee cca gcg 
Pro Glu Thr Arg Arg Pro Glu Pro Ala Pro Arg Thr Arg Ala Pro Ala 
85 90 95 



288 



ggc aga ccc caa ccc age atg age gee gee ace cac teg ccc atg atg 
Gly Arg Pro Gin Pro Ser Met Ser Ala Ala Thr His Ser Pro Met Met 
100 105 110 



336 



cag gtg gcg tec ggc aac ggt gac cgc gac cct ttg ccc ccc gga tgg 
Gin Val Ala Ser Gly Asn Gly Asp Arg Asp Pro Leu Pro Pro Gly Trp 
115 120 125 



384 



gag ate aag ate gac ccg cag ace ggc tgg ccc ttc ttc gtg gac cac 
Glu lie Lys lie Asp Pro Gin Thr Gly Trp Pro Phe Phe Val Asp His 
130 135 140 



432 



aac age cgc ace act acg tgg aac gac ccg cgc gtg ccc tct gag ggc 



480 
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Asn Ser Arg Thr Thr Thr Trp Asn Asp Pro Arg Val Pro Ser Glu Gly 
145 150 155 160 



ccc aag gag act cca tec tct gec aat ggc cct tec egg gag ggc tct 
Pro Lys Glu Thr-Pro Ser Ser Ala Asn Gly Pro Ser Arg Glu Gly Ser 
165 170 175 



528 



agg ctg ccg cct get agg gaa ggc cac cct gtg tac ccc cag etc cga 
Arg Leu Pro Pro Ala Arg Glu Gly His Pro Val Tyr Pro Gin Leu Arg 
180 185 190 



576 



cca ggc tac att ccc att cct gtg etc cat gaa ggc get gag aac egg 
Pro Gly Tyr lie Pro lie Pro Val Leu His Glu Gly Ala Glu Asn Arg 
195 200 205 



624 



cag gtg cac cct ttc cat gtc tat ccc cag cct ggg atg cag cga ttc 
Gin Val His Pro Phe His Val Tyr Pro Gin Pro Gly Met Gin Arg Phe 
210 215 220 



67: 



cga act gag gcg gca gca gcg get cct cag agg tee cag tea cct ctg 
Arg Thr Glu Ala Ala Ala Ala Ala Pro Gin Arg Ser Gin Ser Pro Leu 
225 230 235 240 



720 



egg ggc atg cca gaa acc act cag cca gat aaa cag tgt gga cag gtg 
Arg Gly Met Pro Glu Thr Thr Gin Pro Asp Lys Gin Cys Gly Gin Val 
245 250 255 



768 



gca gcg gcg gcg gca gee cag ccc cca gec tec cac gga cct gag egg 
Ala Ala Ala Ala Ala Ala Gin Pro Pro Ala Ser His Gly Pro Glu Arg 
260 265 270 



816 



tec cag tct cca get gee tct gac tgc tea tec tea tec tec teg gec 
Ser Gin Ser Pro Ala Ala Ser Asp Cys Ser Ser Ser Ser Ser Ser Ala 
275 280 285 



864 



age ctg cct tec tec ggc agg age age ctg ggc agt cac cag etc ccg 
Ser Leu Pro Ser Ser Gly Arg Ser Ser Leu Gly Ser His Gin Leu Pro 
290 295 300 



912 



egg ggg tac ate tec att ccg gtg ata cac gag cag aac gtt acc egg 
Arg Gly Tyr He Ser He Pro Val He His Glu Gin Asn Val Thr Arg 

305 310 315 320 



960 



cca gca gec cag ccc tec ttc cac aaa gee cag aag acg cac tac cca 
Pro Ala Ala Gin Pro Ser Phe His Lys Ala Gin Lys Thr His Tyr Pro 
325 330 335 



1008 



gcg cag agg ggt gag tac cag acc cac cag cct gtg tac cac aag ate 1056 
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Ala Gin Arg Gly Glu Tyr Gin Thr Hxs Gin Pro Val Tyr His Lys He 

340 345 350 

cag ggg gat gac tgg gag ccc egg ccc ctg egg gcg gca tec ccg ttc 1104 

Gin Gly Asp Asp Trp Glu Pro Arg Pro Leu Arg Ala Ala Ser Pro Phe 

355 360 365 

agg tea tct gtc cag ggt gca teg age egg gag ggc tea cca gee agg 1152 

Arg Ser Ser Val Gin Gly Ala Ser Ser Arg Glu Gly Ser Pro Ala Arg 

370 375 380 

age age acg cca etc cac tec ccc teg ccc ate cgt gtg cac ace gtg 1200 

Ser Ser Thr Pro Leu His Ser Pro Ser Pro He Arg Val His Thr Val 

385 390 395 400 

gtc gac agg cct cag cag ccc atg acc cat cga gaa act gca cct gtt 124B 

Val Asp Arg Pro Gin Gin Pro Met Thr His Arg Glu Thr Ala Pro Val 
405 410 415 

tec cag cct gaa aac aaa cca gaa agt aag cca ggc cca gtt gga cca 1296 

Ser Gin Pro Glu Asn Lys Pro Glu Ser Lys Pro Gly Pro Val Gly Pro 

420 425 430 

gaa etc cct cct gga cac ate cca att caa gtg ate cgc aaa gag gtg 1344 

Glu Leu Pro Pro Gly His lie Pro He Gin Val He Arg Lys Glu Val 

435 440 445 

gat tct aaa cct gtt tec cag aag ccc cca cct ccc tct gag aag gta 1392 

Asp Ser Lys Pro Val Ser Gin Lys Pro Pro Pro Pro Ser Glu Lys Val 

450 455 460 

gag gtg aaa gtt ccc cct get cca gtt cct tgt cct cct ccc age cct 1440 

Glu Val Lys Val Pro Pro Ala Pro Val Pro Cys Pro Pro Pro Ser Pro 

465 470 475 480 

ggc cct tct get gtc ccc tct tec ccc aag agt gtg get aca gaa gag 1488 

Gly Pro Ser Ala Val Pro Ser Ser Pro Lys Ser Val Ala Thr Glu Glu 
485 490 495 

agg gca gee ccc age act gee cct gca gaa get aca cct cca aaa cca 1536 

Arg Ala Ala Pro Ser Thr Ala Pro Ala Glu Ala Thr Pro Pro Lys Pro 

500 505 510 

gga gaa gee gag get ccc cca aaa cat cca gga gtg ctg aaa gtg gaa 1584 

Gly Glu Ala Glu Ala Pro Pro Lys His Pro Gly Val Leu Lys Val Glu 

515 520 525 

gee ate ctg gag aag gtg cag ggg ctg gag cag get gta gac aac ttt 1632 

10 
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Ala lie Leu Glu Lys Val Gin Gly Leu Glu Gin Ala Val Asp Asn Phe 
530 535 540 

gaa ggc aag aag act gac aaa aag tac ctg atg ate gaa gag tat ttg 1680 
Glu Gly Lys Lys Thr Asp Lys Lys Tyr Leu Met lie Glu Glu Tyr Leu 
545 550 555 560 

acc aaa gag ctg ctg gec ctg gat tea gtg gac ccc gag gga cga gec 1728 
Thr Lys Glu Leu Leu Ala Leu Asp Ser Val Asp Pro Glu Gly Arg Ala 
565 570 575 

gat gtg cgt cag gec agg aga gac ggt gtc agg aag gtt cag acc ate 1776 
Asp Val Arg Gin Ala Arg Arg Asp Gly Val Arg Lys Val Gin Thr lie 
580 585 590 

ttg gaa aaa ctt gaa cag aaa gec att gat gtc cca ggt caa gtc cag 1824 
Leu Glu Lys Leu Glu Gin Lys Ala lie Asp Val Pro Gly Gin Val Gin 

595 600 605 

gtc tat gaa etc cag ccc age aac ctt gaa gca gat cag cca ctg cag 187 2 
Val Tyr Glu Leu Gin Pro Ser Asn Leu Glu Ala Asp Gin Pro Leu Gin 
610 615 620 

gca ate atg gag atg ggt gee gtg gca gca gac aag ggc aag aaa aat 1920 
Ala lie Met Glu Met Gly Ala Val Ala Ala Asp Lys Gly Lys Lys Asn 
625 630 635 640 

get gga aat gca gaa gat ccc cac aca gaa acc cag cag cca gaa gec 1968 
Ala Gly Asn Ala Glu Asp Pro His Thr Glu Thr Gin Gin Pro Glu Ala 
645 650 655 

aca gca gca gcg act tea aac ccc age age atg aca gac acc cct ggt 2016 
Thr Ala Ala Ala Thr Ser Asn Pro Ser Ser Met Thr Asp Thr Pro Gly 
660 665 670 

aac cca gca gca ccg tagcctctgc cctgtaaaag teagactegg aaccgatgtg 2071 
Asn Pro Ala Ala Pro 
675 

tgctttaggg attttagttg catgeattte agagacttta ggtcagttgg ttttgattag 2131 

ctgcttggta tgeagtaett gggtgaggca aacactataa agggctaaaa gggaaaatga 2191 

tgettttett caatattctt actcttgtac aattaangaa gttgcttgtt gtttgagaag 2251 

tttaaccccg ttgcttgttc tgcagccctg tcnacttggg cacccccacc acctgttagc 2311 

tgtggttgtg cactgtcttt tgtagctctg gactggaggg gtagatgggg agtcaattac 2371 

11 
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ccatcacata aatatgaaac atttatcaga aatgttgcca ttttaatgag atgattttct 2431 
tcatctcata atcaaaatac ctgactttag agagagtaaa atgtgccagg agccatagga 2491 
atatctgtat gttggatgac tttaatgcta catttth 2528 



<210> 6 

<211> 677 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Ala Glu Leu Arg He Gin Pro Arg Ala Ala Ala Asn Phe Ser Gly Leu 
15 10 15 

Asp Gin Lys Phe Leu Ala Gly Gin Leu Leu Pro Pro Phe He Ser Ser 
20 25 30 

Phe Pro Ser Gly Ser Glu Glu Ala He Ser Arg His Phe His Pro Ser 
35 40 45 

Leu Ala Thr Ser Pro Pro Pro Leu lie His Lys Gly Ala Arg Arg Arg 
50 55 60 

Leu Pro Gly His Val Gly Gly Gly Glu Gly Pro Thr Ala Ala Ala Arg 
65 70 75 80 

Pro Glu Thr Arg Arg Pro Glu Pro Ala Pro Arg Thr Arg Ala Pro Ala 
85 90 95 

Gly Arg Pro Gin Pro Ser Met Ser Ala Ala Thr His Ser Pro Met Met 
100 105 110 

Gin Val Ala Ser Gly Asn Gly Asp Arg Asp Pro Leu Pro Pro Gly Trp 
115 120 125 

Glu He Lys He Asp Pro Gin Thr Gly Trp Pro Phe Phe Val Asp His 
130 135 140 

Asn Ser Arg Thr Thr Thr Trp Asn Asp Pro Arg Val Pro Ser Glu Gly 
145 150 155 160 

Pro Lys Glu Thr Pro Ser Ser Ala Asn Gly Pro Ser Arg Glu Gly Ser 
165 170 175 

Arg Leu Pro Pro Ala Arg Glu Gly His Pro Val Tyr Pro Gin Leu Arg 

12 
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180 185 190 

Pro Gly Tyr lie Pro lie Pro Val Leu His Glu Gly Ala Glu Asn Arg 
195 200 205 

Gin Val His Pro Phe His Val Tyr Pro Gin Pro Gly Met Gin Arg Phe 

210 215 220 

Arg Thr Glu Ala Ala Ala Ala Ala Pro Gin Arg Ser Gin Ser Pro Leu 

225 230 235 240 

Arg Gly Met Pro Glu Thr Thr Gin Pro Asp Lys Gin Cys Gly Gin Val 
245 250 255 

Ala Ala Ala Ala Ala Ala Gin Pro Pro Ala Ser His Gly Pro Glu Arg 
260 265 270 

Ser Gin Ser Pro Ala Ala Ser Asp Cys Ser Ser Ser Ser Ser Ser Ala 
275 280 285 

Ser Leu Pro Ser Ser Gly Arg Ser Ser Leu Gly Ser His Gin Leu Pro 
290 295 300 

Arg Gly Tyr lie Ser lie Pro Val lie His Glu Gin Asn Val Thr Arg 
305 310 315 320 

Pro Ala Ala Gin Pro Ser Phe His Lys Ala Gin Lys Thr His Tyr Pro 
325 330 335 

Ala Gin Arg Gly Glu Tyr Gin Thr His Gin Pro Val Tyr His Lys lie 
340 345 350 

Gin Gly Asp Asp Trp Glu Pro Arg Pro Leu Arg Ala Ala Ser Pro Phe 
355 360 365 

Arg Ser Ser Val Gin Gly Ala Ser Ser Arg Glu Gly Ser Pro Ala Arg 
370 375 380 

Ser Ser Thr Pro Leu His Ser Pro Ser Pro lie Arg Val His Thr Val 
385 390 395 400 

Val Asp Arg Pro Gin Gin Pro Met Thr His Arg Glu Thr Ala Pro Val 
405 410 415 

Ser Gin Pro Glu Asn Lys Pro Glu Ser Lys Pro Gly Pro Val Gly Pro 
420 425 430 

Glu Leu Pro Pro Gly His lie Pro lie Gin Val lie Arg Lys Glu Val 

13 
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435 4 4C 445 

Asp Ser Lys Pro Val Ser Gin Lys Pro Pro Pro Pro Ser Glu Lys Val 
450 455 460 

Glu Val Lys Val Pro Pro Ala Pro Val Pro Cys Pro Pro Pro Ser Pro 
465 470 475 480 

Gly Pro Ser Ala Val Pro Ser Ser Pro Lys Ser Val Ala Thr Glu Glu 
485 490 495 

Arg Ala Ala Pro Ser Thr Ala Pro Ala Glu Ala Thr Pro Pro Lys Pro 
500 505 510 

Gly Glu Ala Glu Ala Pro Pro Lys His Pro Gly Val Leu Lys Val Glu 
515 520 525 

Ala lie Leu Glu Lys Val Gin Gly Leu Glu Gin Ala Val Asp Asn Phe 
530 535 540 

Glu Gly Lys Lys Thr Asp Lys Lys Tyr Leu Met lie Glu Glu Tyr Leu 
545 550 555 560 

Thr Lys Glu Leu Leu Ala Leu Asp Ser Val Asp Pro Glu Gly Arg Ala 
565 570 575 

Asp Val Arg Gin Ala Arg Arg Asp Gly Val Arg Lys Val Gin Thr lie 

580 585 590 

Leu Glu Lys Leu Glu Gin Lys Ala lie Asp Val Pro Gly Gin Val Gin 
595 600 605 

Val Tyr Glu Leu Gin Pro Ser Asn Leu Glu Ala Asp Gin Pro Leu Gin 
610 615 620 

Ala lie Met Glu Met Gly Ala Val Ala Ala Asp Lys Gly Lys Lys Asn 
625 630 635 640 

Ala Gly Asn Ala Glu Asp Pro His Thr Glu Thr Gin Gin Pro Glu Ala 
645 650 655 

Thr Ala Ala Ala Thr Ser Asn Pro Ser Ser Met Thr Asp Thr Pro Gly 

660 665 670 

Asn Pro Ala Ala Pro 
675 



1 4 
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<:io> 7 
<::ii> 1010 

<212 ■ DNA 

<213- Homo sapiens 

<220> 
<221> CDS 

<222:- (323) . . (1009) 
<400:> 7 

acgatatcct gtaagaccaa gaattgcaag gccagagttt gaattcttat acaaatggag 60 

cgtatggtcc aacatacccc ccaggccctg gggcaaatac tgcctcatac tcaggggctt 120 

attargcacc tggttatact cagaccagtt actccacaga agttccaagt acttaccgtt 180 

catc':ggcaa cagcccaact ccagtctctc gttggatcta tccccagcag gactgtcaag 240 

actgaagcac cccctcttaa ggggcaggtt ccaggatatc cgccttcaca gaaccctgga 300 

atgaccctgc cccattatcc tt atg gag atg gta ate gta gtg ttc cac aat 352 

Met Glu Met Val lie Val Val Phe His Asn 
1 5 10 

cac ggc cga ctg tac gac cac aag aaa gat gcg tgg get tct cct ggt 400 
His Gly Arg Leu Tyr Asp His Lys Lys Asp Ala Trp Ala Ser Pro Gly 
15 20 25 

get tat gga atg ggt ggc cgt tat ccc tgg cct tea tea gcg ccc tea 448 
Ala Tyr Gly Met Gly Gly Arg Tyr Pro Trp Pro Ser Ser Ala Pro Ser 
30 35 40 

gca cca ccc ggc aat etc tac atg act gaa agt act tea cca tgg cct 496 
Ala Pro Pro Gly Asn Leu Tyr Met Thr Glu Ser Thr Ser Pro Trp Pro 
45 50 55 

age agt ggc tct ccc cag tea ccc cct tea ccc cca gtc cag cag ccc 544 
Ser Ser Gly Ser Pro Gin Ser Pro Pro Ser Pro Pro Val Gin Gin Pro 
60 65 70 

aag gat tct tea tac ccc tat age caa tea gat caa age atg aac egg 592 
Lys Asp Ser Ser Tyr Pro Tyr Ser Gin Ser Asp Gin Ser Met Asn Arg 
75 80 85 90 

cac aac ttt cct tgc agt gtc cat cag tac gaa tec teg ggg aca gtg 640 
His Asn Phe Pro Cys Ser Val His Gin Tyr Glu Ser Ser Gly Thr Val 
95 100 105 
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aac aat gat qat tea gat ctt ttq gat tec caa gtc cag tat agt get 688 

Asn Asn Asp Asp Ser Asp Leu Leu Asp Ser Gin Val Gin Tyr Ser Ala 
110 115 120 

gag cct cag ctg tat ggt aat gec acc agt gac cat ccc aac aat caa 736 

Glu Pro Gin Leu Tyr Gly Asn Ala Thr Ser Asp His Pro Asn Asn Gin 
125 130 135 

gat caa agt agt agt ctt cct gaa gaa tgt gta cct tea gat gaa agt 784 

Asp Gin Ser Ser Ser Leu Pro Glu Glu Cys Val Pro Ser Asp Glu Ser 
140 145 150 

act cct ccg agt att aaa aaa ate ata cat gtg ctg gag aag gtc cag 832 

Thr Pro Pro Ser He Lys Lys lie lie His Val Leu Glu Lys Val Gin 
155 160 165 170 

tat ctt gaa caa gaa gta gaa gaa ttt gta gga aaa aag aca gac aaa 880 

Tyr Leu Glu Gin Glu Val Glu Glu Phe Val Gly Lys Lys Thr Asp Lys 

175 180 185 

gca tac tgg ctt ctg gaa gaa atg eta acc aag gaa ctt ttg gaa ctg 928 

Ala Tyr Trp Leu Leu Glu Glu Met Leu Thr Lys Glu Leu Leu Glu Leu 
190 195 200 

gat tea gtt gaa act ggg ggc cag gac tct gta egg cag gee aga aaa 976 

Asp Ser Val Glu Thr Gly Gly Gin Asp Ser Val Arg Gin Ala Arg Lys 
205 210 215 

gag get gtt tgt aag att cag gec ata ttg gaa a 1010 
Giu Ala Val Cys Lys He Gin Ala He Leu Giu 
220 225 



<210> 8 

<211> 229 

<212> PRT 

<213> Homo sapiens 

<400> 8 

Met Glu Met Val He Val Val Phe His Asn His Gly Arg Leu Tyr Asp 
15 10 15 

His Lys Lys Asp Ala Trp Ala Ser Pro Gly Ala Tyr Gly Met Gly Gly 
20 25 30 

Arg Tyr Pro Trp Pro Ser Ser Ala Pro Ser Ala Pro Pro Gly Asn Leu 
35 40 45 
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Tyr Met Thr Glu Ser Thr Ser Pro Trp Pro Ser Ser Gly Ser Pro Gin 
50 55 60 

Ser Pro Pro Ser Pro Pro Val Gin Gin Pro Lys Asp Ser Ser Tyr Pro 
65 70 75 80 

Tyr Ser Gin Ser Asp Gin Ser Met Asn Arg His Asn Phe Pro Cys Ser 
85 90 95 

Val His Gin Tyr Glu Ser Ser Gly Thr Val Asn Asn Asp Asp Ser Asp 
100 105 110 

Leu Leu Asp Ser Gin Val Gin Tyr Ser Ala Glu Pro Gin Leu Tyr Gly 
115 120 125 

Asn Ala Thr Ser Asp His Pro Asn Asn Gin Asp Gin Ser Ser Ser Leu 
130 135 140 

Pro Glu Glu Cys Val Pro Ser Asp Glu Ser Thr Pro Pro Ser lie Lys 
145 150 155 160 

Lys lie lie His Val Leu Glu Lys Val Gin Tyr Leu Glu Gin Glu Val 
165 170 175 

Glu Glu Phe Val Gly Lys Lys Thr Asp Lys Ala Tyr Trp Leu Leu Glu 
180 185 190 

Glu Met Leu Thr Lys Glu Leu Leu Glu Leu Asp Ser Val Glu Thr Gly 
195 200 205 

Gly Gin Asp Ser Val Arg Gin Ala Arg Lys Glu Ala Val Cys Lys lie 
210 215 220 

Gin Ala lie Leu Glu 
225 



<210> 9 

<211> 689 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (3) . . (482) 

<220> 

<221> unsure 
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<222> (105) 

<223> any amino acid 

<4 00> 9 

ga gaa ata aaa aat gaa ctt etc caa gca caa aac cct tct gaa ttg 47 

Glu lie Lys Asn GIu Leu Leu Gin Ala Gin Asn Pro Ser Glu Leu 

1 5 10 15 

tac ctg age tec aaa aca gaa ttg cag ggt tta att gga cag ttg gat 95 
Tyr Leu Ser Ser Lys Thr Glu Leu Gin Gly Leu lie Gly Gin Leu Asp 

20 25 30 

gag gta agt ntt gaa aaa aac ccc tgc ate egg gaa gee agg aga aga 143 
Glu Val Ser Xaa Glu Lys Asn Pro Cys lie Arg Glu Ala Arg Arg Arg 
35 40 45 

gca gtg ate gag gtg caa act ctg ate aca tat att gac ttg aag gag 191 
Ala Val lie Glu Val Gin Thr Leu lie Thr Tyr lie Asp Leu Lys Glu 
50 55 60 

gec ctt gag aaa aga aag ctg ttt get tgt gag gag cac cca tec cat 239 
Ala Leu Glu Lys Arg Lys Leu Phe Ala Cys Glu Glu His Pro Ser His 
65 70 75 

aaa gec gtc tgg aac gtc ctt gga aac ttg tct gag ate cag gga gaa 287 
Lys Ala Val Trp Asn Val Leu Gly Asn Leu Ser Glu lie Gin Gly Glu 

80 85 90 95 

gtt ctt tea ttt gat gga aat cga ace gat aag aac tac ate egg ctg 335 
Val Leu Ser Phe Asp Gly Asn Arg Thr Asp Lys Asn Tyr lie Arg Leu 
100 105 110 

gaa gag ctg etc ace aag cag ctg eta gec ctg gat get gtt gat ccg 383 
Glu Glu Leu Leu Thr Lys Gin Leu Leu Ala Leu Asp Ala Val Asp Pro 
115 120 125 

cag gga gaa gag aag tgt aag get gec agg aaa caa get gtg agg ctt 431 
Gin Gly Glu Glu Lys Cys Lys Ala Ala Arg Lys Gin Ala Val Arg Leu 
130 135 140 

gcg cag aat att etc age tat etc gac ctg aaa tct gat gaa tgg gag 479 
Ala Gin Asn lie Leu Ser Tyr Leu Asp Leu Lys Ser Asp Glu Trp Glu 
145 150 155 

tac tgaaatacca gagatctcac ttttgatact gttttgeact tcatatgtgc 532 

Tyr 

160 
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ttctatgtat agagagcttt cagt t cat t g atttatacgt gcatatrtca gtctcagtat 592 

ttatgattga agcaaattc: attcagtatc tgctgct t t t gatgttgcaa gacaaatatc 652 

attacagcac gttaactttt ccattcggat caaaaaa 689 



<210> 10 

<211> 160 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Glu He Lys Asn Glu Leu Leu Gin Ala Gin Asn Pro Ser Glu Leu Tyr 
15 10 15 

Leu Ser Ser Lys Thr Glu Leu Gin Gly Leu He Gly Gin Leu Asp Glu 

20 25 30 

Val Ser Xaa Glu Lys Asn Pro Cys He Arg Glu Ala Arg Arg Arg Ala 
35 40 45 

Val He Glu Val Gin Thr Leu He Thr Tyr He Asp Leu Lys Glu Ala 
50 55 60 

Leu Glu Lys Arg Lys Leu Phe Ala Cys Glu Glu His Pro Ser His Lys 
65 70 75 80 

Ala Val Trp Asn Val Leu Gly Asn Leu Ser Glu He Gin Gly Glu Val 
85 90 95 

Leu Ser Phe Asp Gly Asn Arg Thr Asp Lys Asn Tyr He Arg Leu Glu 
100 105 110 

Glu Leu Leu Thr Lys Gin Leu Leu Ala Leu Asp Ala Val Asp Pro Gin 
115 120 125 

Gly Glu Glu Lys Cys Lys Ala Ala Arg Lys Gin Ala Val Arg Leu Ala 
130 135 140 

Gin Asn He Leu Ser Tyr Leu Asp Leu Lys Ser Asp Glu Trp Glu Tyr 
145 150 155 160 



<210> 11 
<211> 246 
<212> DNA 

<213> Caenorhabdi tis elegans 
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< 4 o o > i : 

atgcctttci' qcctcttcqt tgaaatattt cactttcttt tccagctttt tccccatctc 60 

gacctgcttt ggtttttcga gaaaaccacg ttccaaatca gcgacatctc tcaaattgag 120 

atcataggct ttttgaagat tgctcaaatt a:gcttctca ta:tgcatga gcattttgaa 180 

gcccgcgtca tcaaccaaag cattttttcc acccatcaca atgattttat cattttcttt 240 

aaaaU 246 



<210> 12 
<211> 210 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 12 

Met Lys Val Asn Val Ser Cys Ser Ser Val Gin Thr Thr He Asp He 
15 10 15 

Leu Glu Glu Asn Gin Gly Glu Asp Glu Ser lie Leu Thr Leu Gly Gin 
20 25 30 

Leu Arg Asp Arg He Ala Thr Asp Asn Asp Val Asp Val Glu Thr Met 
35 40 45 

Lys Leu Leu His Arg Gly Lys Phe Leu Gin Gly Ala Asp Asp Val Ser 
50 55 60 

Leu Ser Thr Leu Asn Phe Lys Glu Asn Asp Lys He He Val Met Gly 
65 "70 75 80 

Gly Lys Asn Ala Leu Val Asp Asp Ala Gly Phe Lys Met Leu Met Gin 
85 90 95 

Tyr Glu Lys His Asn Leu Ser Asn Leu Gin Lys Ala Tyr Asp Leu Asn 
100 105 HO 

Leu Arg Asp Val Ala Asp Leu Glu Arg Gly Phe Leu Glu Lys Pro Lys 
115 120 125 

Gin Val Giu Met Gly Lys Lys Leu Glu Lys Lys Val Lys Tyr Phe Asn 
13 0 13 5 14 0 

Glu Glu Ala Glu Arg His Leu Glu Thr Leu Asp Gly Met Asn He He 
145 150 155 160 
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Thr Giu Thr Thr Pro Glu Asn Gin Ala Lys Arg Asr. Arg Glu Lys Arg 
165 170 175 

Lys Thr Leu Val Asn Gly lie Gin Thr Leu Leu Asn Gin Asn Asp Ala 
180 185 190 

Leu Leu Arg Arg Leu Gin Glu Tyr Gin Ser Val Leu Asn Gly Asp lie 
195 200 205 

Pro Glu 
210 



<210> 13 
<211> 1377 
< 2 1 2 > DNA 

<213> Caenorhabdit is elegans 

<220> 

<221> CDS 

<222> (1) . . (1377 ) 

<400> 13 

atg cca gtc gtg aac ata cca ate aaa ata ctt ggt cag aat caa tea 48 

Met Pro Val Val Asn lie Pro He Lys He Leu Gly Gin Asn Gin Ser 

15 10 15 

cat agt cga agt aac tec teg tct tct gtt gac aac gat cga aat caa 96 
His Ser Arg Ser Asn Ser Ser Ser Ser Val Asp Asn Asp Arg Asn Gin 
20 25 30 

cca cca cag cag cca cct caa ccg caa cca caa cag caa tct cag caa 144 
Pro Pro Gin Gin Pro Pro Gin Pro Gin Pro Gin Gin Gin Ser Gin Gin 
35 40 45 

caa tac cag cag get cca aac gtg aat acc aat atg cat cat tec aac 192 
Gin Tyr Gin Gin Ala Pro Asn Val Asn Thr Asn Met His His Ser Asn 
50 55 60 

gga ttc tea cct aac ttc cca tct cgt agt cct att ccg gac ttt ccc 240 
Gly Phe Ser Pro Asn Phe Pro Ser Arg Ser Pro lie Pro Asp Phe Pro 
65 70 75 80 

agt ttt tea tct ggg ttc cca aac gat tct gaa tgg tct teg aat ttc 288 
Ser Phe Ser Ser Gly Phe Pro Asn Asp Ser Glu Trp Ser Ser Asn Phe 
85 90 95 



21 



BNSDOCID <WO 



WO 00/14106 PCT/US99/21053 

ccg teg ttt cca aat ttc cca agt gga ttc tea aat gga agt tct aat 336 

Pro Ser Phe Fro Asn Phe Pro Ser Gly Phe Ser Asn Gly Ser Ser Asr. 
ICO 105 HO 

ttc cct gat ttt cca aga ttc gga aga gat gga gga eta teg eca aac 384 

Phe Pro Asp Phe Pro Arg Phe Gly Arg Asp Gly Gly Leu Ser Pro Asn 
115 120 125 

cca ccg atg caa gga tac agg aga agt cca aca cca aca tea act caa 432 

Pro Pro Met Gin Gly Tyr Arg Arg Ser Pro Thr Pro Thr Ser Thr Gin 
130 135 140 

tct cca act tct aca tta aga cgc aac tct cag cag aat caa get cct 480 

Ser Pro Thr Ser Thr Leu Arg Arg Asn Ser Gin Gin Asn Gin Ala Pro 
145 150 155 160 

cca caa tat tct cag caa caa cca caa caa get caa caa cgt cag aca 528 

Pro Gin Tyr Ser Gin Gin Gin Pro Gin Gin Ala Gin Gin Arg Gin Thr 

165 170 175 

act cct ccg tea aca aaa get tea tct cga cca cca tct cgt act cgt 576 

Thr Pro Pro Ser Thr Lys Ala Ser Ser Arg Pro Pro Ser Arg Thr Arg 
180 185 190 

gaa cca aag gaa cct gag gta ccc gag aga cca gca gtt att cca ttg 624 

Glu Pro Lys Glu Pro Glu Val Pro Glu Arg Pro Ala Val lie Pro Leu 
195 200 205 

cca tat gag aag aag gag aaa cca ctg gag aag aaa ggt agt cgt gat 672 

Pro Tyr Glu Lys Lys Glu Lys Pro Leu Glu Lys Lys Gly Ser Arg Asp 
210 215 220 

tct gga aag ggt gat gag aac ctt gaa gag aac att gee aag ate acg 720 

Ser Gly Lys Gly Asp Glu Asn Leu Glu Glu Asn lie Ala Lys lie Thr 
225 230 235 240 



ate gga aag aat aat tgc gag tta tgt ccg gaa caa gaa acg gac ggc 

lie Gly Lys Asn Asn Cys Glu Leu Cys Pro Glu Gin Glu Thr Asp Gly 

245 250 255 

gac cca tct cca eta acc tec cca ate acc gaa gga aag cca aag aga 

Asp Pro Ser Pro Leu Thr Ser Pro lie Thr Glu Gly Lys Pro Lys Arg 

260 265 270 

gga aag aaa ctt caa cgt aat caa agt gtt gtt gat ttc aat gee aag 

Gly Lys Lys Leu Gin Arg Asn Gin Ser Val Val Asp Phe Asn Ala Lys 

275 280 285 



768 



816 



864 
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aca att gtt act ttq gat aaa att gaa tta caa gtt gag cag ttq aga 
Thr He Val Thr Leu Asp Lys lie Glu Leu Gin Val Glu Gin Leu Arg 
290 295 300 



9i: 



aaa aaa get get gaa etc gaa atg gaa aaa gag caa att ctt cgt ret 
Lys Lys Ala Ala Glu Leu Glu Met Glu Lys Glu Gin He Leu Arg Ser 
305 310 315 320 



960 



eta gga gaa ate agt gtt cat aac tgc atg ttc aaa ctg gaa gaa tgt 
Leu Gly Glu He Ser Val His Asn Cys Met Phe Lys Leu Glu Glu Cys 
325 330 335 



1008 



gat cgt gaa gag att gaa gca ate act gac cga ttg aca aaa aga aca 
Asp Arg Glu Glu He Glu Ala He Thr Asp Arg Leu Thr Lys Arg Thr 
340 345 350 



105b 



aag aca gtt caa gtt gtt gtc gaa act cca cga aat gaa gaa cag aaa 
Lys Thr Val Gin Val Val Val Glu Thr Pro Arg Asn Glu Glu Gin Lys 
355 360 365 



no*; 



aaa gca ctg gaa gat gca act ttg atg ate gat gaa gtc gga gaa atg 
Lys Ala Leu Glu Asp Ala Thr Leu Met He Asp Glu Val Gly Glu Met 
370 375 380 



11 52 



atg cat teg aat att gaa aag get aag ctg tgc eta caa ace tac atg 
Met His Ser Asn He Glu Lys Ala Lys Leu Cys Leu Gin Thr Tyr Met 
385 390 395 400 



1200 



aac gee tgt teg tac gaa gaa act get gga gee acc tgc caa aac ttc 
Asn Ala Cys Ser Tyr Glu Glu Thr Ala Gly Ala Thr Cys Gin Asn Phe 
405 410 415 



1243 



ttg aag ate ata att cag tgc get get gat gat cag aaa cgc ate aag 
Leu Lys He He He Gin Cys Ala Ala Asp Asp Gin Lys Arg He Lys 
420 425 430 



1296 



cgt cgt ctg gaa aat ctg atg tct caa att gag aat get gag aga acg 
Arg Arg Leu Glu Asn Leu Met Ser Gin He Glu Asn Ala Glu Arg Thr 
435 440 445 



1344 



aaa gca gat ttg atg gat gat caa age gaa tag 
Lys Ala Asp Leu Met Asp Asp Gin Ser Glu 
450 455 



1377 



<210> 14 
<211> 458 
<212> PRT 



23 



WO 00/14106 PO7US99/21053 

< 2 1 3 > Caenorhabditis eleqans 

<4 0C> 14 

Met Pro Val Val Asn He Pro He Lys He Leu GLy Gin Asn Gin Ser 
15 10 15 

His Ser Arg Ser Asn Ser Ser Ser Ser Val Asp Asn Asp Arg Asn Gin 

20 25 30 

Pro Pro Gin Gin Pro Pro Gin Pro Gin Pro Gin Gin Gin Ser Gin Gin 
35 40 45 

Gin Tyr Gin Gin Ala Pro Asn Val Asn Thr Asn Met His His Ser Asn 
50 55 60 

Gly Phe Ser Pro Asn Phe Pro Ser Arg Ser Pro He Pro Asp Phe Pro 
65 70 75 80 

Ser Phe Ser Ser Gly Phe Pro Asn Asp Ser Giu Trp Ser Ser Asn Phe 
85 90 95 

Pro Ser Phe Pro Asn Phe Pro Ser Gly Phe Ser Asn Gly Ser Ser Asn 
100 105 HO 

Phe Pro Asp Phe Pro Arg Phe Gly Arg Asp Gly Gly Leu Ser Pro Asn 
115 120 125 

Pro Pro Met Gin Gly Tyr Arg Arg Ser Pro Thr Pro Thr Ser Thr Gin 
130 135 140 

Ser Pro Thr Ser Thr Leu Arg Arg Asn Ser Gin Gin Asn Gin Ala Pro 
145 150 155 160 

Pro Gin Tyr Ser Gin Gin Gin Pro Gin Gin Ala Gin Gin Arg Gin Thr 
165 170 175 

Thr Pro Pro Ser Thr Lys Ala Ser Ser Arg Pro Pro Ser Arg Thr Arg 
180 185 190 

Glu Pro Lys Glu Pro Glu Val Pro Glu Arg Pro Ala Val He Pro Leu 
195 200 205 

Pro Tyr Glu Lys Lys Glu Lys Pro Leu Glu Lys Lys Gly Ser Arg Asp 
210 215 220 

Ser Gly Lys Gly Asp Giu Asn Leu Giu Glu Asn He Ala Lys He Thr 
225 230 235 240 
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lie Gly Lys Asn Asn Cys Glu Leu Cys Pro Glu Gin Glu Thr Asp Gly 
245 250 255 

Asp Pro Ser Pro Leu Thr Ser Pro He Thr Glu Gly Lys Pro Lys Arg 
260 265 270 

Gly Lys Lys Leu Gin Arg Asn Gin Ser Val Val Asp Phe Asn Ala Lys 

275 280 285 

Thr He Val Thr Leu Asp Lys He Glu Leu Gin Val Glu Gin Leu Arg 
290 295 300 

Lys Lys Ala Ala Glu Leu Glu Met Glu Lys Glu Gin He Leu Arg Ser 
305 310 315 320 

Leu Gly Glu He Ser Val His Asn Cys Met Phe Lys Leu Glu Glu Cys 

325 330 335 

Asp Arg Glu Glu lie Glu Ala He Thr Asp Arg Leu Thr Lys Arg Thr 
340 345 350 

Lys Thr Val Gin Val Val Val Glu Thr Pro Arg Asn Glu Glu Gin Lys 

355 360 . 365 

Lys Ala Leu Glu Asp Ala Thr Leu Met He Asp Glu Val Gly Glu Met 
370 375 380 

Met His Ser Asn He Glu Lys Ala Lys Leu Cys Leu Gin Thr Tyr Met 
385 390 395 400 

Asn Ala Cys Ser Tyr Glu Glu Thr Ala Gly Ala Thr Cys Gin Asn Phe 
405 410 415 

Leu Lys He He He Gin Cys Ala Ala Asp Asp Gin Lys Arg lie Lys 
420 425 430 

Arg Arg Leu Glu Asn Leu Met Ser Gin He Glu Asn Ala Glu Arg Thr 
435 440 445 

Lys Ala Asp Leu Met Asp Asp Gin Ser Glu 
450 455 



<210> 15 

<211> 588 

<212> DNA 

<213> Schizosaccharomyces pombe 



25 



WO 00/14106 PCT/US99/21053 



< 2 2 0 > 

<22\> CDS 

<222.> (1 ) . . ( 588 ) 



<400:> 15 

atg tea gaa aag act age aca gtt aca ata cac tat gga aat cag cga 48 

Met Ser Glu Lys Thr Ser Thr Val Thr lie His Tyr Gly Asn Gin Arg 

5 1C 15 



ttt ccg gta gca gtc aat eta aat gag acg tta agt gaa ctg ate gat 96 
Phe Pro Val Ala Val Asn Leu Asn Glu Thr Leu Ser Glu Leu Tie Asp 
20 25 30 



gat tta ctt gaa acg act gag att tct gag aag aaa gtc aag ctt ttt 144 
Asp Leu Leu Glu Thr Thr Glu lie Ser Glu Lys Lys Val Lys Leu Phe 
35 40 45 



tac get qgc aag cgt tta aaa gac aaa aaa gec teg tta tea aaa ttg 192 
Tyr Ala Gly Lys Arg Leu Lys Asp Lys Lys Ala Ser Leu Ser Lys Leu 
50 55 60 



ggt tta aaa aat cat agt aaa att eta tgt ata aga cca cat aag caa 240 
Gly Leu Lys Asn His Ser Lys lie Leu Cys lie Arg Pro His Lys Gin 
65 70 75 80 



caa cga ggt tec aag gaa aaa gac acg gtt gag ccc get ccg aaa gcg 288 
Gin Arg Gly Ser Lys Glu Lys Asp Thr Val Glu Pro Ala Pro Lys Ala 
85 90 95 



gaa gcg gag aat cct gta ttt teg cgt att tct gga gaa ata aaa gec 336 
Glu Ala Glu Asn Pro Val Phe Ser Arg lie Ser Gly Glu lie Lys Ala 
100 105 110 



ate gat cag tat gtt gac aaa gaa ctt tec ccc atg tac gac aat tac 384 
lie Asp Gin Tyr Val Asp Lys Glu Leu Ser Pro Met Tyr Asp Asn Tyr 
115 120 125 



gta aat aaa ccg teg aac gat cca aag cag aaa aac aaa cag aaa eta 432 
Val Asn Lys Pro Ser Asn Asp Pro Lys Gin Lys Asn Lys Gin Lys Leu 
130 135 140 



atg ata agt gaa eta ctt tta caa cag ctt tta aaa ttg gat gga gtt 480 
Met lie Ser Glu Leu Leu Leu Gin Gin Leu Leu Lys Leu Asp Gly Val 
145 150 155 160 



gac gta ctg ggc age gag aaa ttg cgt 
Asp Val Leu Gly Ser Glu Lys Leu Arg 
165 



ttt gaa egg aag caa ctt gtt 528 
Phe Glu Arg Lys Gin Leu Val 
170 175 
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tct aaq ate caa aaa atg ttg gat cac gtt gac caa aca aqc caa gaa 576 
Ser Lys lie Gin Lys Met Leu Asp His Val Asp Gin Thr Se: Gin Glu 
180 185 19C 

gtg gec gca tag 588 
Val Ala Ala 
195 



<210> 16 
<211> 195 
<212> PRT 

<213> Schi zosaccharomyces pombe 
<400> 16 

Met Ser Glu Lys Thr Ser Thr Val Thr lie His Tyr Gly Asn Gin Arg 
15 10 15 

Phe Pro Val Ala Val Asn Leu Asn Glu Thr Leu Ser Glu Leu lie Asp 

20 25 30 

Asp Leu Leu Glu Thr Thr Glu lie Ser Glu Lys Lys Val Lys Leu Phe 
35 40 45 

Tyr Ala Gly Lys Arg Leu Lys Asp Lys Lys Ala Ser Leu Ser Lys Leu 
50 55 60 

Gly Leu Lys Asn His Ser Lys lie Leu Cys lie Arg Pro His Lys Gin 
65 70 75 80 

Gin Arg Gly Ser Lys Glu Lys Asp Thr Val Glu Pro Ala Pro Lys Ala 
85 90 95 

Glu Ala Glu Asn Pro Val Phe Ser Arg lie Ser Gly Glu lie Lys Ala 
100 105 110 

He Asp Gin Tyr Val Asp Lys Glu Leu Ser Pro Met Tyr Asp Asn Tyr 
115 120 125 

Val Asn Lys Pro Ser Asn Asp Pro Lys Gin Lys Asn Lys Gin Lys Leu 
130 135 140 

Met He Ser Glu Leu Leu Leu Gin Gin Leu Leu Lys Leu Asp Gly Val 
145 150 155 160 

Asp Val Leu Gly Ser Glu Lys Leu Arg Phe Glu Arg Lys Gin Leu Val 
165 170 175 
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Ser Lys He Gin Lys Met Leu Asp His Val Asp Gin Thr Ser Gin Giu 
180 185 190 

Val Ala Ala 
195 



<210> 17 

<211> 621 

<212:» DNA 

<213> Schizosaccharomyces pombe 

<220> 

<221> CDS 

<222s (1) . . (621) 

<400> 17 

atg tct ttt ttt acc cag ttg tgt tct atg gat aaa aaa tat tgg ate 48 

Met Ser Phe Phe Thr Gin Leu Cys Ser Met Asp Lys Lys Tyr Trp He 
15 10 15 

tct eta get gta ttg tea gtt act gtt ttg att age gca tta ttg aaa 96 
Ser Leu Ala Val Leu Ser Val Thr Val Leu lie Ser Ala Leu Leu Lys 
20 25 30 

aag aga get act gaa acc gaa gat att gtc gtt gtt cat tac gat ggc 144 
Lys Arg Ala Thr Glu Thr Glu Asp lie Val Val Val His Tyr Asp Gly 
35 40 45 

gaa aag ttg aat ttt gtg ttg cga caa cca agg ctg aat atg gtt tct 192 
Glu Lys Leu Asn Phe Val Leu Arg Gin Pro Arg Leu Asn Met Val Ser 
50 55 60 

tac act agt ttt ctt cgt cgc gtg tgc aac gca ttt tea gta atg ccc 240 
Tyr Thr Ser Phe Leu Arg Arg Val Cys Asn Ala Phe Ser Val Met Pro 
65 70 75 80 

gac aaa gcg tct etc aag tta aac ggg gtg acc etc aag gat ggt tea 288 
Asp Lys Ala Ser Leu Lys Leu Asn Gly Val Thr Leu Lys Asp Gly Ser 
85 90 95 



336 



ctt tec gac caa aat gtg caa aat gga agt gaa tta gag etc gaa tta 

Leu Ser Asp Gin Asn Val Gin Asn Gly Ser Glu Leu Glu Leu Glu Leu 

100 105 110 

ccc aaa ctg age ccg gca atg caa caa att gaa gca tat ata gat gag 384 

Pro Lys Leu Ser Pro Ala Met Gin Gin lie Glu Ala Tyr lie Asp Glu 
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115 120 125 

ctt caa cag gat etc gtc cct aaa att gaa gec ttc tgc caa teg tct 432 

Leu Gin Gin Asp Leu Val Pro Lys lie Glu Ala Phe Cys Gin Ser Ser 

130 135 140 

ccc get teg gca caa gat gtt caa gat ttg cat aca cgc ctt agt gaa 480 

Pro Ala Ser Ala Gin Asp Val Gin Asp Leu His Thr Arg Leu Ser Glu 
145 150 155 160 

aca ttg ttg get agg atg ata aaa tta gat get gtt aat gtt gaa gac 528 

Thr Leu Leu Ala Arg Met He Lys Leu Asp Ala Val Asn Val Glu Asp 
165 170 175 

gac cca gaa get cgt ctt aaa aga aaa gaa get att cgt tta tct caa 576 

Asp Pro Glu Ala Arg Leu Lys Arg Lys Glu Ala He Arg Leu Ser Gin 
180 185 190 

caa tat ttg agt aaa eta gat tec acc aag aat caa aac aaa tga 621 

Gin Tyr Leu Ser Lys Leu Asp Ser Thr Lys Asn Gin Asn Lys 
195 200 205 



<210> 18 
<211> 206 
<212> PRT 

<213> Schizosaccharomyces pombe 
<400> 18 

Met Ser Phe Phe Thr Gin Leu Cys Ser Met Asp Lys Lys Tyr Trp He 
15 10 15 

Ser Leu Ala Val Leu Ser Val Thr Val Leu lie Ser Ala Leu Leu Lys 
20 25 30 

Lys Arg Ala Thr Glu Thr Glu Asp He Val Val Val His Tyr Asp Gly 
35 40 45 

Glu Lys Leu Asn Phe Val Leu Arg Gin Pro Arg Leu Asn Met Val Ser 
50 55 60 

Tyr Thr Ser Phe Leu Arg Arg Val Cys Asn Ala Phe Ser Val Met Pro 
65 70 75 80 

Asp Lys Ala Ser Leu Lys Leu Asn Gly Val Thr Leu Lys Asp Gly Ser 
85 90 95 

Leu Ser Asp Gin Asn Val Gin Asn Gly Ser Glu Leu Glu Leu Glu Leu 
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100 106 

Pre Lys Leu Ser Pro Ala Met: Gin Gin 
115 120 

Leu Gin Gin Asp Leu Val Pro Lys lie 
130 135 

Pro Ala Ser Ala Gin Asp Val Gin Asp 
145 150 

Thr Leu Leu Ala Arg Met lie Lys Leu 
165 

Asp Pro Glu Ala Arg Leu Lys Arg Lys 

180 185 

Gin Tyr Leu Ser Lys Leu Asp Ser Thr 
195 200 




PCT/US99/21053 

i 10 

lie Glu Aia Tyr lie Asp Glu 
125 

Glu Ala Phe Cys Gin Ser Ser 
140 

Leu His Thr Arg Leu Ser Glu 

155 160 

Asp Ala Val Asn Val Glu Asp 
170 175 

Glu Ala lie Arg Leu Ser Gin 
190 

Lys Asn Gin Asn Lys 
205 



<210> 19 

<211> 2534 

<212> DNA 

<213> Homo sapiens 

<220> 
<22i> CDS 

<222> (307) . . (2034 ) 
<400> 19 

gcggagctcc gcatccaacc ccgggccgcg gccaacttct ctggactgga ccagaagttt 60 

ctagccggcc agttgctacc tccctttatc tcctccttcc cctctggcag cgaggaggct 120 

atttccagac acttccaccc ctctctggcc acgtcacccc cgcctttaat tcataaaggt 180 

gcccggcgcc ggcttcccgg acacgtcggc ggeggagagg ggcccacggc ggcggcccgg 240 

ccagagactc ggcgcccgga gccagcgccc cgcacccgcg ccccagcggg cagaccccaa 300 

cccagc atg age gec gec acc cac teg ccc atg atg cag gtg gcg tec 348 
Met Ser Ala Ala Thr His Ser Pro Met Met Gin Val Ala Ser 
1 5 10 

ggc aac ggt gac cgc gac cct ttg ccc ccc gga tgg gag ate aag ate 396 
Gly Asn Gly Asp Arg Asp Pro Leu Pro Pro Gly Trp Glu lie Lys He 
15 20 25 30 
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gac ccg cag acc ggc tgg ccc ttc ttc gtg gac cac aac aqc cgc acc 444 
Asp Pro Gin Thr Gly Trp Pro Phe Phe Val Asp His Asn Ser Arg Thr 
35 40 45 

act acg tgg aac gac ccg cgc gtg ccc tct gag ggc ccc aag gag act 492 
Thr Thr Trp Asn Asp Pro Arg Val Pro Ser Glu Gly Pro Lys Glu Thr 
50 55 60 

cca tec tct gec aat ggc cct tec egg gag ggc tct agg ctg ccg cct 540 
Pro Ser Ser Ala Asn Gly Pro Ser Arg Glu Gly Ser Arg Leu Pro Pro 
65 70 75 

get agg gaa ggc cac cct gtg tac ccc cag etc cga cca ggc tac att 588 
Ala Arg Glu Gly His Pro Val Tyr Pro Gin Leu Arg Pro Gly Tyr lie 
80 85 90 

ccc att cct gtg etc cat gaa ggc get gag aac egg cag gtg cac cct 636 
Pro lie Pro Val Leu His Glu Gly Ala Glu Asn Arg Gin Val His Pro 
95 100 105 110 

ttc cat gtc tat ccc cag cct ggg atg cag cga ttc cga act gag gcg 684 
Phe His Val Tyr Pro Gin Pro Gly Met Gin Arg Phe Arg Thr Glu Ala 
115 120 125 

gca gca gcg get cct cag agg tec cag tea cct ctg egg ggc atg cca 732 
Ala Ala Ala Ala Pro Gin Arg Ser Gin Ser Pro Leu Arg Gly Met Pro 
130 135 140 

gaa acc act cag cca gat aaa cag tgt gga cag gtg gca gcg gcg gcg 780 
Glu Thr Thr Gin Pro Asp Lys Gin Cys Gly Gin Val Ala Ala Ala Ala 
145 150 155 

gca gec cag ccc cca gee tec cac gga cct gag egg tec cag tct cca 828 
Ala Ala Gin Pro Pro Ala Ser His Gly Pro Glu Arg Ser Gin Ser Pro 
160 165 170 

get gec tct gac tgc tea tec tea tec tec teg gec age ctg cct tec 876 
Ala Ala Ser Asp Cys Ser Ser Ser Ser Ser Ser Ala Ser Leu Pro Ser 
175 180 185 190 

tec ggc agg age age ctg ggc agt cac cag etc ccg egg ggg tac ate 924 
Ser Gly Arg Ser Ser Leu Gly Ser His Gin Leu Pro Arg Gly Tyr He 
195 200 205 

tec att ccg gtg ata cac gag cag aac gtt acc egg cca gca gee cag 972 
Ser He Pro Val He His Glu Gin Asn Val Thr Arg Pro Ala Ala Gin 
210 215 220 
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ccc tec ttc cac aaa gec cag aag acg cac tac cca gcg cag agg g g t 1020 

Pro Ser Phe His Lys Ala Gin Lys Thr His Tyr Pro Ala Gin Arg Gly 

225 230 235 

gag tac cag acc cac cag cct gtg tac cac aag ate cag ggg gat gac 1068 

Glu Tyr Gin Thr His Gin Pro Val Tyr His Lys lie Gin Gly Asp Asp 
240 245 250 

tgg gag ccc egg ccc ctg egg gcg gca tec ccg ttc agg tea tct gtc 1116 

Trp Glu Pro Arg Pro Leu Arg Ala Ala Ser Pro Phe Arg Ser Ser Val 

255 260 265 270 

cag ggt gca teg age egg gag ggc tea cca gee agg age age aeg cca 1164 

Gin Gly Ala Ser Ser Arg Glu Gly Ser Pro Ala Arg Ser Ser Thr Pro 

275 280 285 

etc cac tec ccc teg ccc ate cgt gtg cac ace gtg gtc gac agg cct 1212 

Leu His Ser Pro Ser Pro He Arg Val His Thr Val Val Asp Arg Pro 
290 295 300 

cag cag ccc atg acc cat cga gaa act gca cct gtt tec cag cct gaa 1260 

Gin Gin Pro Met Thr His Arg Glu Thr Ala Pro Val Ser Gin Pro Glu 

305 310 315 

aac aaa cca gaa agt aag cca ggc cca gtt gga cca gaa etc cct cct 1308 

Asn Lys Pro Glu Ser Lys Pro Gly Pro Val Gly Pro Glu Leu Pro Pro 
320 325 330 

gga cac ate cca att caa gtg ate cgc aaa gag gtg gat tct aaa cct 1356 

Gly His He Pro He Gin Val He Arg Lys Glu Val Asp Ser Lys Pro 

335 340 345 350 

gtt tec cag aag ccc cca cct ccc tct gag aag gta gag gtg aaa gtt 1404 

Val Ser Gin Lys Pro Pro Pro Pro Ser Glu Lys Val Glu Val Lys Val 

355 360 365 

ccc cct get cca gtt cct tgt cct cct ccc age cct ggc cct tct get 1452 

Pro Pro Ala Pro Val Pro Cys Pro Pro Pro Ser Pro Gly Pro Ser Ala 
370 375 380 

gtc ccc tct tec ccc aag agt gtg get aca gaa gag agg gca gec ccc 1500 

Val Pro Ser Ser Pro Lys Ser Val Ala Thr Glu Glu Arg Ala Ala Pro 

385 390 395 

age act gee cct gca gaa get aca cct cca aaa cca gga gaa gee gag 1548 

Ser Thr Ala Pro Ala Glu Ala Thr Pro Pro Lys Pro Gly Glu Ala Glu 
400 405 410 

32 



BNSDOCID <WO _ OC14106A' i > 



WO 00/14106 PCT/US99/21053 



get ccc cca aaa cat cca gga gtg ctg aaa gtg gaa gec ate ctg gag 1596 

Ala Pro Pro Lys His Pro Gly Val Leu Lys Val Giu Ala lie Leu Glu 

415 420 425 430 

aag gtg cag ggg ctg gag cag get gta gac aac ttt gaa ggc aag aag 1644 

Lys Val Gin Gly Leu Glu Gin Ala Val Asp Asn Phe Glu Gly Lys Lys 

435 440 445 

act gac aaa aag tac ctg atg ate gaa gag tat ttg ac: aaa gag ctg 1692 

Thr Asp Lys Lys Tyr Leu Met lie Glu Glu Tyr Leu Thr Lys Glu Leu 
450 455 460 

ctg gec ctg gat tea gtg gac ccc gag gga cga gee gat gtg cgt cag 1740 

Leu Ala Leu Asp Ser Val Asp Pro Glu Gly Arg Ala Asp Val Arg Gin 
465 470 475 

gec agg aga gac ggt gtc agg aag gtt cag ace ate ttg gaa aaa ctt 178H 

Ala Arg Arg Asp Gly Val Arg Lys Val Gin Thr lie Leu Glu Lys Leu 

480 485 490 

gaa cag aaa gec att gat gtc cca ggt caa gtc cag gtc tat gaa etc 1836 

Glu Gin Lys Ala lie Asp Val Pro Gly Gin Val Gin Val Tyr Glu Leu 

495 500 505 510 

cag ccc age aac ctt gaa gca gat cag cca ctg cag gca ate atg gag 1884 

Gin Pro Ser Asn Leu Giu Ala Asp Gin Pro Leu Gin Ala lie Met Glu 

515 520 525 

atg ggt gec gtg gca gca gac aag ggc aag aaa aat get gga aat gca 1932 

Met Gly Ala Val Ala Ala Asp Lys Gly Lys Lys Asn Ala Gly Asn Ala 
530 535 540 

gaa gat ccc cac aca gaa acc cag cag cca gaa gec aca gca gca gcg 1980 

Glu Asp Pro His Thr Glu Thr Gin Gin Pro Glu Ala Thr Ala Ala Ala 
545 550 555 

act tea aac ccc age age atg aca gac acc cct ggt aac cca gca gca 2028 

Thr Ser Asn Pro Ser Ser Met Thr Asp Thr Pro Gly Asn Pro Ala Ala 

560 565 570 



ccg tag cctctgccct gtaaaaatca gaeteggaac cgatgtgtgc tttagggaat 2084 

Pro 

575 

tttaagttgc atgeatttea gagactttaa gtcagttggt ttttattagc tgcttggtat 2144 
gcagtaactt gggtggaggc aaaacactaa taaaagggct aaaaaggaaa atgatgcttt 2204 
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tcttctatat tc:ta::ctg tacaaataaa gaa g 1 1 g c t t gttgtttgag aagtttaacc 2 2 6 4 

ccgttgcttg ttctgcagcc ctqtctactt aggcaccccc accacctgtt acctgtggtt 2324 

gcgcactgtc ttttgtagct ctggactgga ggggtagatg gggagtcaat tacccatcac 2384 

atiaaatanga aacatttatc agaaatgttg ccatttiaat gagatgattt tcttcatctc 2444 

auattaaaa tacctgactt tagagagagt aaaatgtqcc aggagccata ggaatatctg 2504 

tatgttggat gactttaatg ctacattttc 2534 



<210> 20 

<211> 57 5 

<212:» PRT 

<213> Homo sapiens 

<400> 20 

Met Ser Ala Ala Thr His Ser Pro Met Met Gin Val Ala Ser Gly Asn 
15 10 15 

Gly Asp Arg Asp Pro Leu Pro Pro Gly Trp Glu lie Lys lie Asp Pro 
20 25 30 

Gin Thr Gly Trp Pro Phe Phe Val Asp His Asn Ser Arg Thr Thr Thr 
35 40 45 

Trp Asn Asp Pro Arg Val Pro Ser Glu Gly Pro Lys Glu Thr Pro Ser 
50 55 60 

Ser Ala Asn Gly Pro Ser Arg Glu Gly Ser Arg Leu Pro Pro Ala Arg 
65 70 75 80 

Glu Gly His Pro Val Tyr Pro Gin Leu Arg Pro Gly Tyr lie Pro lie 
85 90 95 

Pro Val Leu His Glu Gly Ala Glu Asn Arg Gin Val His Pro Phe His 
100 105 110 

Val Tyr Pro Gin Pro Gly Met Gin Arg Phe Arg Thr Glu Ala Ala Ala 
115 120 125 

Ala Ala Pro Gin Arg Ser Gin Ser Pro Leu Arg Gly Met Pro Glu Thr 
130 135 140 

Thr Gin Pro Asp Lys Gin Cys Gly Gin Val Ala Ala Ala Ala Ala Ala 
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145 150 155 160 

Gin Pro Pro Aid Ser His Giy Pro Glu Arc, Ger Gin Ser Pro Ala Ala 
165 170 175 

Ser Asp Cys Ser Ser Ser Ser Ser Ser Ala Ser Leu Pro Ser Ser Gly 
180 185 190 

Arg Ser Ser Leu Gly Ser His Gin Leu Pro Arg Gly Tyr lie Ser lie 

195 200 205 

Pro Val lie His Glu Gin Asn Val Thr Arg Pro Ala Ala Gin Pro Ser 
210 215 220 

Phe His Lys Ala Gin Lys Thr His Tyr Pro Ala Gin Arg Gly Glu Tyr 
225 230 235 240 

Gin Thr His Gin Pro Val Tyr His Lys lie Gin Gly Asp Asp Trp Glu 
245 250 255 

Pro Arg Pro Leu Arg Ala Ala Ser Pro Phe Arg Ser Ser Val Gin Gly 
260 265 270 

Ala Ser Ser Arg Glu Gly Ser Pro Ala Arg Ser Ser Thr Pro Leu His 
275 280 285 

Ser Pro Ser Pro lie Arg Val His Thr Val Val Asp Arg Pro Gin Gin 
290 295 300 

Pro Met Thr His Arg Glu Thr Ala Pro Val Ser Gin Pro Glu Asn Lys 
305 310 315 320 

Pro Glu Ser Lys Pro Gly Pro Val Giy Pro Glu Leu Pro Pro Gly His 
325 330 335 

lie Pro lie Gin Val lie Arg Lys Glu Val Asp Ser Lys Pro Val Ser 
340 345 350 

Gin Lys Pro Pro Pro Pro Ser Glu Lys Val Glu Val Lys Val Pro Pro 
355 360 365 

Ala Pro Val Pro Cys Pro Pro Pro Ser Pro Gly Pro Ser Ala Val Pro 
370 375 380 

Ser Ser Pro Lys Ser Val Ala Thr Glu Glu Arg Ala Ala Pro Ser Thr 
385 390 395 400 

Ala Pro Ala Glu Ala Thr Pro Pro Lys Pro Gly Glu Ala Glu Ala Pro 
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405 410 415 

Ptd Lys His Pro Gly Val Leu Lys Val Glu Ala lie Leu Glu Lys Val 
420 425 430 

Gin Gly Leu Glu Gin Ala Val Asp Asn Phe Glu Gly Lys Lys Thr Asp 

4 3 5 4 4 0 4 4 5 

Lys Lys Tyr Leu Met lie Glu Glu Tyr Leu Thr Lys Glu Leu Leu Ala 
450 455 460 

Leu Asp Ser Val Asp Pro Glu Gly Arg Ala Asp Val Arg Gin Ala Arg 
465 470 475 480 

Arg Asp Gly Val Arg Lys Val Gin Thr lie Leu Glu Lys Leu Glu Gin 
485 490 495 

Lys Ala lie Asp Val Pro Gly Gin Val Gin Val Tyr Glu Leu Gin Pro 
500 505 510 

Ser Asn Leu Glu Ala Asp Gin Pro Leu Gin Ala lie Met Glu Met Gly 
515 520 525 

Ala Val Ala Ala Asp Lys Gly Lys Lys Asn Ala Gly Asn Ala Glu Asp 
530 535 540 

Pro His Thr Glu Thr Gin Gin Pro Glu Ala Thr Ala Ala Ala Thr Ser 
545 550 555 560 

Asn Pro Ser Ser Met Thr Asp Thr Pro Gly Asn Pro Ala Ala Pro 
565 570 575 



<210> 21 

<211> 1966 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> ( 43) . . (14 16) 

<400> 21 

cggtgggagc ggggcgggaa gcgcttcagg gcagcggatc cc atg teg gec ctg 54 

Met Ser Ala Leu 

1 

agg cgc teg ggc tac ggc ccc agt gac ggt ccg tec tac ggc cgc tac 102 
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Arg Arg Ser Gly Tyr Gly Pro Ser Asp Gly Pro Ser Tyr Gly Arq Tyr 

10 15 20 

tac ggg cct ggg ggt gga gat: gtg ccg gta cac cca ccr. cca ccc tta 15 0 
Tyr Gly Pro Gly Gly Gly Asp Vai Pro Val His Pro Pro Pro Pro Leu 
25 30 35 

tat cct ctt cgc cct gaa cct ccc cag cct ccc att tec tgg egg gtg 198 
Tyr Pro Leu Arg Pro Glu Pro Pro Gin Pro Pro lie Ser Trp Arg Val 
40 45 50 

cgc ggg ggc ggc ccg gcg gag acc acc tgg ctg gga gaa ggc gga gga 24 6 
Arg Gly Gly Gly Pro Ala Glu Thr Thr Trp Leu Gly Glu Gly Gly Gly 
55 60 65 

ggc gat ggc tac tat ccc teg gga ggc gec tgg cca gag cct ggt cga 294 
Gly Asp Gly Tyr Tyr Pro Ser Gly Gly Ala Trp Pro Glu Pro Gly Arg 

70 7 5 8 0 

gec gga gga age cac cag gag cag cca cca tat cct age tac aat tct 342 
Ala Gly Gly Ser His Gin Glu Gin Pro Pro Tyr Pro Ser Tyr Asn Ser 
85 90 95 100 

aac tat tgg aat tct act gcg aga tct agg get cct tac cca agt aca 390 
Asn Tyr Trp Asn Ser Thr Ala Arg Ser Arg Ala Pro Tyr Pro Ser Thr 
105 110 115 

tat cct gta aga cca gaa ttg caa ggc cag agt ttg aat tct tat aca 438 
Tyr Pro Val Arg Pro Glu Leu Gin Gly Gin Ser Leu Asn Ser Tyr Thr 
120 125 130 

aat gga gcg tat ggt cca aca tac ccc cca ggc cct ggg gca aat act 486 
Asn Gly Ala Tyr Gly Pro Thr Tyr Pro Pro Gly Pro Gly Ala Asn Thr 
135 140 145 

gec tea tac tea ggg get tat tat gca cct ggt tat act cag acc agt 534 
Ala Ser Tyr Ser Gly Ala Tyr Tyr Ala Pro Gly Tyr Thr Gin Thr Ser 
150 155 160 

tac tec aca gaa gtt cca agt act tac cgt tea tct ggc aac age cca 582 
Tyr Ser Thr Glu Val Pro Ser Thr Tyr Arg Ser Ser Gly Asn Ser Pro 
165 170 175 180 

act cca gtc tct cgt tgg ate tat ccc cag cag gac tgt cag act gaa 630 
Thr Pro Val Ser Arg Trp He Tyr Pro Gin Gin Asp Cys Gin Thr Glu 
185 190 195 

gca ccc cct ctt agg ggg cag gtt cca gga tat ccg cct tea cag aac 678 
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Ala Pro Pro Leu Arg Gly Gin Vai Pro Giy Tyr Pro Pro Ser Gin Asr. 

200 205 210 

cct gga atg ace ctg ccc cat tat cct tat gga gat ggt aat cgt agt 726 

Pro Gly Met Thr Leu Pro His Tyr Pro Tyr Gly Asp Gly Asn Arg Ser 

215 220 225 

gtt cca caa tea gga ccg act gta cga cca caa gaa gat gcg tgg get 

Val Pro Gin Ser Gly Pro Thr Val Arg Pro Gin Glu Asp Ala Trp Ala 

230 235 240 

tct cct ggt get tat gga atg ggt ggc cgt tat ccc tgg cct tea tea 

Ser Pro Gly Ala Tyr Gly Met Gly Gly Arg Tyr Pro Trp Pro Ser Ser 

245 250 255 260 

gcg ccc tea gca cca ccc ggc aat etc tac atg act gaa agt act tea 

Ala Pro Ser Ala Pro Pro Gly Asn Leu Tyr Met Thr Glu Ser Thr Ser 

265 270 275 

cca tgg cct age agt ggc tct ccc cag tea ccc cct tea ccc cca gtc 

Pro Trp Pro Ser Ser Gly Ser Pro Gin Ser Pro Pro Ser Pro Pro Val 

280 285 290 

cag cag ccc aag gat tct tea tac ccc tat age caa tea gat caa age 966 

Gin Gin Pro Lys Asp Ser Ser Tyr Pro Tyr Ser Gin Ser Asp Gin Ser 

295 300 305 



774 



822 



870 



918 



1014 



atg aac egg cac aac ttt cct tgc agt gtc cat cag tac gaa tec teg 

Met Asn Arg His Asn Phe Pro Cys Ser Val His Gin Tyr Glu Ser Ser 

310 315 320 

ggg aca gtg ate aat gaa gat tea gat ctt ttg gat tec caa gtc cag 1062 

Gly Thr Val lie Asn Glu Asp Ser Asp Leu Leu Asp Ser Gin Val Gin 

325 330 335 340 

tat agt get gag cct cag ctg tat ggt aat gee acc agt gac cat ccc 1110 

Tyr Ser Ala Glu Pro Gin Leu Tyr Gly Asn Ala Thr Ser Asp His Pro 

345 350 355 

aac aat caa gat caa agt age agt ctt cct gaa gaa tgt gta cct tea 1158 

Asn Asn Gin Asp Gin Ser Ser Ser Leu Pro Glu Glu Cys Val Pro Ser 

360 365 370 

gat gaa agt act cct ccg agt att aaa aaa ate ata cat gtg ctg gag 1206 
Asp Glu Ser Thr Pro Pro Ser He Lys Lys He He His Val Leu Glu 

375 380 385 



aag gtc cag tat ctt gaa caa gaa gta gaa gaa ttt gta gga aaa aag 
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Lys Val Gin Tyr Leu Glu Gin Glu Val Glu Glu Phe Val Gly Lys Lys 

3 90 3 9 5 4 00 

aca gac aaa gca tac tgg ctt ctg gaa gaa atg eta acc aag gaa ctt 1301! 
Tnr Asp Lys Ala Tyr Trp Leu Leu Glu Glu Met Leu Thr Lys Glu Leu 
405 410 415 420 

ttg gaa ctg gat tea gtt gaa act ggg ggc cag gac tct gta egg cag 1350 
Leu Glu Leu Asp Ser Val Glu Thr Gly Gly Gin Asp Ser Val Arg Gin 
425 430 435 

gec aga aaa gag get gtt tgt aag att cag gec ata ctg gaa aaa tta 1398 
Ala Arg Lys Glu Ala Val Cys Lys He Gin Ala He Leu Glu Lys Leu 
440 445 450 

gaa aaa aaa gga tta tqa aaggatttag aacaaagtgg aagcctgtta 1446 

Glu Lys Lys Gly Leu 
455 

ctaacttgac caaagaacac ttgattaggt taattaccct ctttttgaaa tgcctgttga 1506 

tgacaagaag caatacattc cagcttttcc tttgatttta tacttgaaaa actggcaaag 156b 

gaatggaaga atattttagt catgaagttg ttttcagttt tcagacgaat gaatgtaata 1626 

ggaaactatg gagttaccaa tattgecaag tagactcact ccttaaaaaa tttatggata 1686 

tctacaagct gcttattac: agcaggaggg aaacacactt cacacaacag gcttatcaga 1746 

aacctaccag atqaaactqg atataatttg agacaaacag gatgtgtttt tttaaacatc 1606 

tggatatctt gtcacatttt tgtacattgt gaetgettte aacatatact tcatgtgtaa 1866 

ttatagctta gaetttagee ttcttggact tctgttttgt tttgttattt gcagtttaca 1926 



aatatagtat tattctctaa aaaaaaaaaa aaaaaaaaaa 



1966 



<210> 22 
<211> 457 
<212> PRT 

<213> Homo sapiens 
<400> 22 

Met Ser Ala Leu Arg Arg Ser Gly Tyr Gly Pro Ser Asp Gly Pro Ser 
15 10 15 

Tyr Gly Arg Tyr Tyr Gly Pro Gly Gly Gly Asp Val Pro Val His Pro 
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20 25 30 

Pro Pro Pro Leu Tyr Pro Leu Arg Pro Glu Pro Pro Gin Pro Pro lie 
35 40 45 

Ser Trp Arg Val Arg Gly Gly Gly Pro Ala Glu Thr Thr Trp Leu Gly 

50 55 60 

Glu Gly Gly Gly Gly Asp Gly Tyr Tyr Pro Ser Gly Gly Ala Trp Pro 
65 70 75 80 

Glu Pro Gly Arg Ala Gly Gly Ser His Gin Glu Gin Pro Pro Tyr Pro 
85 90 95 

Ser Tyr Asn Ser Asn Tyr Trp Asn Ser Thr Ala Arg Ser Arg Ala Pro 
100 105 110 

Tyr Pro Ser Thr Tyr Pro Val Arg Pro Glu Leu Gin Gly Gin Ser Leu 
115 120 125 

Asn Ser Tyr Thr Asn Gly Ala Tyr Gly Pro Thr Tyr Pro Pro Gly Pro 
130 135 140 

Gly Ala Asn Thr Ala Ser Tyr Ser Gly Ala Tyr Tyr Ala Pro Gly Tyr 
145 150 155 160 

Thr Gin Thr Ser Tyr Ser Thr Glu Val Pro Ser Thr Tyr Arg Ser Ser 
165 170 175 

Gly Asn Ser Pro Thr Pro Val Ser Arg Trp lie Tyr Pro Gin Gin Asp 
180 185 190 

Cys Gin Thr Glu Ala Pro Pro Leu Arg Gly Gin Val Pro Gly Tyr Pro 
195 200 205 

Pro Ser Gin Asn Pro Gly Met Thr Leu Pro His Tyr Pro Tyr Gly Asp 
210 215 220 

Gly Asn Arg Ser Val Pro Gin Ser Gly Pro Thr Val Arg Pro Gin Glu 
225 230 235 240 

Asp Ala Trp Ala Ser Pro Gly Ala Tyr Gly Met Gly Gly Arg Tyr Pro 

245 250 255 

Trp Pro Ser Ser Ala Pro Ser Ala Pro Pro Gly Asn Leu Tyr Met Thr 
260 265 270 

Glu Ser Thr Ser Pro Trp Pro Ser Ser Gly Ser Pro Gin Ser Pro Pro 
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275 28C 285 

Ser Pro Pro Val Gin Gin Pro Lys Asp Ser Ser Tyr Pro Tyr Ser Gin 
290 295 300 

Ser Asp Gin Ser Met Asn Arg His Asn Phe Pro Cys Ser Val His Gin 
305 310 315 320 

Tyr Glu Ser Ser Gly Thr Val lie Asn Glu Asp Ser Asp Leu Leu Asp 
325 330 335 

Ser Gin Val Gin Tyr Ser Ala Glu Pro Gin Leu Tyr Gly Asn Ala Thr 
340 345 350 

Ser Asp His Pro Asn Asn Gin Asp Gin Ser Ser Ser Leu Pro Glu Glu 
355 360 365 

Cys Val Pro Ser Asp Glu Ser Thr Pro Pro Ser lie Lys Lys lie lie 
370 375 380 

His Val Leu Glu Lys Val Gin Tyr Leu Glu Gin Glu Val Glu Glu Phe 
385 390 395 400 

Val Gly Lys Lys Thr Asp Lys Ala Tyr Trp Leu Leu Glu Glu Met Leu 
405 410 415 

Thr Lys Glu Leu Leu Glu Leu Asp Ser Val Glu Thr Gly Gly Gin Asp 
420 425 430 

Ser Val Arg Gin Ala Arg Lys Glu Ala Val Cys Lys lie Gin Ala He 
435 440 445 

Leu Glu Lys Leu Glu Lys Lys Gly Leu 
450 455 



<210> 23 

<211> 4308 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (247) . . (1590) 
<400> 23 

cccccccccc cccccccccc ccngaagacg cccggagcgg ctgctgcagc cagtagcggc 60 
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cccttcaccg gctgccccgc tcagacctag tcgggagggg tgcgaggcat gcagctgggg 120 

gcccagctcc ggtgccgcac cccgtaaagg gctgatcttc cacctcgcca cctcagccac 180 

gggacgccaa gaccgcatcc aattcagact tcttttggtg cttgrgaaac tgaacacaac 240 

aaaagt atg gat atg gga aac caa cat cct tct att agt agg ctt cag 288 
Met Asp Met Gly Asn Gin His Pro Ser lie Ser Arg Leu Gin 

1 5 10 

gaa ate caa aag gaa gta aaa agt gta gaa cag caa gtt ate ggc ttc 336 
Glu lie Gin Lys Glu Val Lys Ser Val Glu Gin Gin Val lie Gly Phe 
15 20 25 30 

agt ggt ctg tea gat gac aag aat tac aag aaa ctg gag agg att eta 384 
Ser Gly Leu Ser Asp Asp Lys Asn Tyr Lys Lys Leu Glu Arg lie Leu 
35 40 45 

aca aaa cag ctt ttt gaa ata gac tct gta gat act gaa gga aaa gga 432 
Thr Lys Gin Leu Phe Glu lie Asp Ser Val Asp Thr Glu Gly Lys Gly 
50 55 60 

gat att cag caa get agg aag egg gca gca cag gag aca gaa cgt ctt 480 
Asp lie Gin Gin Ala Arg Lys Arg Ala Ala Gin Glu Thr Glu Arg Leu 
65 70 75 

etc aaa gag ttg gag cag aat gca aac cac cca cac egg att gaa ata 528 
Leu Lys Glu Leu Glu Gin Asn Ala Asn His Pro His Arg lie Glu lie 

80 85 90 

cag aac att ttt gag gaa gee cag tec etc gtg aga gag aaa att gtg 576 
Gin Asn He Phe Glu Glu Ala Gin Ser Leu Val Arg Glu Lys He Val 
95 100 105 HO 

cca ttt tat aat gga ggc aac tgc gta act gat gag ttt gaa gaa ggc 624 
Pro Phe Tyr Asn Gly Gly Asn Cys Val Thr Asp Glu Phe Glu Glu Gly 
115 120 125 

ate caa gat ate att ctg agg ctg aca cat gtt aaa act gga gga aaa 672 
He Gin Asp He He Leu Arg Leu Thr His Val Lys Thr Gly Gly Lys 
130 135 140 

ate tec ttg egg aaa gca agg tat cac act tta acc aaa ate tgt gcg 720 
He Ser Leu Arg Lys Ala Arg Tyr His Thr Leu Thr Lys He Cys Ala 
145 150 155 

gtg caa gag ata ate gaa gac tgc atg aaa aag cag cct tec ctg ccg 768 
Val Gin Glu He He Glu Asp Cys Met Lys Lys Gin Pro Ser Leu Pro 
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165 



170 



ctt tec gag gat gca cat cct tec gtt gec aaa ate aac ttc gtg atg 
Leu Ser Glu Asp Ala His Pro Ser Vai Ala Lys lie Asn Phe Val Met 
175 180 185 190 



lib 



tgt gag gtg aac aag gec cga ggg gtc ctg att gca ctt ctg atg ggt 
Cys Glu Val Asn Lys Ala Arg Gly Val Leu lie Ala Leu Leu Met Gly 
195 200 205 



864 



gtg aac aac aat gag acc tgc agg cac tta tec tgt gtg etc teg ggg 
Val Asn Asn Asn Glu Thr Cys Arg His Leu Ser Cys Val Leu Ser Gly 
210 215 220 



912 



ctg ate get gac ctg gat get eta gat gtg tgc ggc egg aca gaa ate 
Leu lie Ala Asp Leu Asp Ala Leu Asp Val Cys Gly Arg Thr Glu lie 
225 230 235 



960 



aga aat tat egg agg gag gta gta gaa gat ate aac aaa tta ttg aaa 
Arg Asn Tyr Arg Arg Glu Val Val Glu Asp lie Asn Lys Leu Leu Lys 
240 245 250 



1008 



tat ctg gat ttg gaa gag gaa gca gac aca act aaa gca ttt gac ctg 
Tyr Leu Asp Leu Glu Glu Glu Ala Asp Thr Thr Lys Ala Phe Asp Leu 
255 260 265 270 



1056 



aga cag aat cat tec att tta aaa ata gaa aag gtc etc aag aga atg 
Arg Gin Asn His Ser lie Leu Lys lie Glu Lys Val Leu Lys Arg Met 

275 280 285 



1104 



aga gaa ata aaa aat gaa ctt etc caa gca caa aac cct tct gaa ttg 
Arg Glu lie Lys Asn Glu Leu Leu Gin Ala Gin Asn Pro Ser Glu Leu 
290 295 300 



1152 



tac ctg age tec aaa aca gaa ttg cag ggt tta att gga cag ttg gat 
Tyr Leu Ser Ser Lys Thr Glu Leu Gin Gly Leu lie Gly Gin Leu Asp 
305 310 315 



1200 



gag gta agt ctt gaa aaa aac ccc tgc ate egg gaa gec agg aga aga 
Glu Val Ser Leu Glu Lys Asn Pro Cys lie Arg Glu Ala Arg Arg Arg 
320 325 330 



1248 



gca gtg ate gag gtg caa act ctg ate aca tat att gac ttg aag gag 
Ala Val He Glu Val Gin Thr Leu He Thr Tyr lie Asp Leu Lys Glu 
335 340 345 350 



1296 



gec ctt gag aaa aga aag ctg ttt get tgt gag gag cac cca tec cat 
Ala Leu Glu Lys Arg Lys Leu Phe Ala Cys Glu Glu His Pro Ser His 



1344 
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355 3 60 3 65 

aaa gcc gtc egg aac gtc ctt gga aac teg tct gag ate cag gga gaa 1392 
Lys Ala Val Trp Asn Val Leu Gly Asn Leu Ser GIu lie Gin Gly Glu 
370 375 380 

gtt ctt tea ttt gat gga aat cga acc gat aag aac tac ate egg etq 1440 
Val Leu Ser Phe Asp Gly Asn Arg Thr Asp Lys Asn Tyr lie Arg Leu 
385 390 395 

gaa gag ctg etc acc aag cag ctg eta gcc ctg gat get gtt gat ccg 1483 
Glu Glu Leu Leu Thr Lys Gin Leu Leu Ala Leu Asp Ala Val Asp Pro 
400 405 410 

cag gga gaa gag aag tgt aag get gcc agg aaa caa get gtg agg ctt 1536 
Gin Gly Glu Glu Lys Cys Lys Ala Ala Arg Lys Gin Ala Val Arg Leu 
415 420 425 430 

gcg cag aat att etc age tat etc gac ctg aaa tct gat gaa tgg gag 1584 
Ala Gin Asn lie Leu Ser Tyr Leu Asp Leu Lys Ser Asp Glu Trp Glu 
435 440 445 

tac tga aataccagag atctcacttt tgatactgtt ttgeacttea tatgtgcttc 1640 
Tyr 

tatgtataga gagctttcag ttcattgatt tataegtgea tatttcagtc tcagtattta 1700 

tgattgaage aaattctatt cagtatctgc tgcttttgat gttgeaagac aaatatcatt 1760 

acagcaegtt aacttttcca tteggatcat tatctgtatg atgtggtgtg gtttgtttgg 1820 

tttgtccttt tttttgcgtt tttaatcaga aaacaaaata qaggcagctt ttgtagattt 1880 

taaatgggtt gtgeaagcat taaaatgcag gtctttcaga atctagaact aggcataacc 1940 

ttacataata ctaggaaaat tatgagaaag gggaaatttt tggttaaata agagtaaggt 2000 

tcaaacacaa gcagtacatg ttctgtttca ttatgetega tagaaggctt ttttttcact 2060 

tataaggect gattggtcct acccagctta acggggtggg gtttttttgt ttgttcagae 2120 

agtctgttct tttgtaaaca tttttagttg gaaaaacagc atetgeattt tccccatcct 2180 

etaegtttta gagaggaatc ttgtttttgt gtgeaacata agaaaattat gaaaactaat 2240 

agecaaaaaa cctttgagat tgcattaaag agaagggata aaggaccagc aataatacct 2300 

tgtaagttgc ttttgtttgt aaaatctgag cttatagttt tccttagtga gtaaattcat 2360 
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aaggatggga acatttaaat taagttaatg 
tacctgtagt tggaggatga atactggaga 
taaatcagaa agtctgaatg tagcacataa 
atggacagcc ttgtcacacc tccccggtgc 
gtaacccaga gggaccaggc cttcctaggt 
ttagtaaatg tcataactac acctgctcca 
aaggcttcct ctgggtgcct gctgaacaac 
ctctgttgcc tgaaagagcc attaaagtca 
atgtgtattt ccataaatgc tttctgagga 
aagtgccttg agaacatgtg ggtccgagtg 
ttgcctggtc atcctgttag agtacatctr 
gatgctcatt gtgtaactct gtgtagggag 
gctaaaggag tagccttaaa tacctaaaag 
cagcttgtct ctcagtattt cccaatcatg 
aatgttctag aatcgctgga cggtggggtc 
tcccatacta ctgcaggtcc aactcctggc 
ccacgttttg gccacagtag ttgtaggatt 
ttaaaatctt gaggaagagt ttttattttt 
ccaggctgca gtgcagtggt gccatctcag 
gcgattctcc tgcctcagcc acctgagtag 
ggctaatttt tgtattttta atagagttga 
aactcctgac ctcgtgatcc gcccgcctcg 
gagccacggc gcccagccca ggaagagttt 
tgggaaatca tggttacgct tcaggcatat 
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ggcctttaaa aaaaaaaaag gaaacactca 2 42 0 
cgggttiacca atgtcaggtit atactaaaac 2.4 80 
tggttctctt ctgttgtcca aggctgtaaa 2540 
tgttttacaa cgtgagggta gacgctgtca 2600 
tttctaggca gtcagctgtt aaccactcac 2660 
ggaccaatca gtgaaacctg ctcggaatta 2720 
tgagctcatg tcatgggcat gtggtggttt 2780 
gtcgtgcgtg aagcatctct cttctaaagg 2840 
tccggtacaa aatgatttcc caaagttctg 2900 
ttataacaga ctcctccccc gggtcacctt 2960 
tggaaatcca gggtaatatt ctctttcaga 3020 
atagtcactt taaacagctc aaagtagcta 3080 
atgacagaag catagccctt aacaaatctt 3140 
aaaatccctt gctatgtctt tcctactaga 3200 
agagggcagt cggtatttag gccgtgagct 3260 
aaccgcgggc tcaaggcagg tcattggaat 3320 
gcttttctgt atcataattt tagaatgctc 3380 
tatttatttt tgagatggag tctctgttgc 3440 
ctcactgcaa cctccacctc ccaggttcaa 3500 
ctgggagtac aggcatgtgg caccatgcct 3560 
gatttcacca tgatggtcag gctggtctcg 3620 
gccccccaaa gtgctgggat taacgggtgt 3680 
ttaaattaga gctctgttta attataccac 3740 
tcttccccag agtactactt acattttaaa 3800 
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tttcattttg taaagttaaa tgtcagcatt ccctttaaaa gtgtccattg ttctttgaaa 3860 

gtagacgttt caqtcattct tttcaaacaa gtgtttgtgt accttttgcc aagctgtggg 3920 

catcgtgtgt gagtacaggg tgctcagctc ttccaccgtc attttgaatt gttcacatgg 3980 

gtaattggtc atggaaatga tcagattgac cttgattgac tgtcaggcat ggctttgttt 4040 

ctagtttcaa tctgttctcg ttccttgtac cggattattc tactcctgca atgaaccctg 4100 

ttgacaccgg atttagctct tgtcggcctt cgtggggagc tgtttgtgtt aatatgagct 4160 

actgcatgta attcttaaac tgggcttgtc acattgtatt gtatttttgt gatctgtaat 4220 

gaaaagaatc tgtactgcaa gtaaaaccta ctccccaaaa atgtgtggct ttgggtctgc 4280 
attaaacgct gtagtccatg ttcatgcc 4308 



<210> 24 

< 2 1 1 > 447 

<212> PRT 

<213> Homo sapiens 

<400> 24 

Met Asp Met Gly Asn Gin His Pro Ser lie Ser Arg Leu Gin Glu lie 
15 10 15 

Gin Lys Glu Val Lys Ser Val Glu Gin Gin Val lie Gly Phe Ser Gly 
20 25 30 

Leu Ser Asp Asp Lys Asn Tyr Lys Lys Leu Glu Arg lie Leu Thr Lys 
35 40 45 

Gin Leu Phe Glu lie Asp Ser Val Asp Thr Glu Gly Lys Gly Asp lie 
50 55 60 

Gin Gin Ala Arg Lys Arg Ala Ala Gin Glu Thr Glu Arg Leu Leu Lys 
65 70 75 80 

Glu Leu Glu Gin Asn Ala Asn His Pro His Arg lie Glu lie Gin Asn 
85 90 95 

lie Phe Glu Glu Ala Gin Ser Leu Val Arg Glu Lys lie Val Pro Phe 
100 105 110 

Tyr Asn Gly Gly Asn Cys Val Thr Asp Glu Phe Glu Glu Gly He Gin 
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115 120 125 

Asp He He Leu Arq Leu Thr His Val Lys Thr Gly Gly Lys He Ser 
130 135 140 

Leu Arg Lys Ala Arg Tyr His Thr Leu Thr Lys He Cys Ala Val Gin 
145 150 155 160 

Glu He He Glu Asp Cys Met Lys Lys Gin Pro Ser Leu Pro Leu Ser 
165 170 175 

Glu Asp Ala His Pro Ser Val Ala Lys He Asn Phe Val Met Cys Glu 
180 185 190 

Val Asn Lys Ala Arg Gly Val Leu He Ala Leu Leu Met Gly Val Asn 
195 200 205 

Asn Asn Glu Thr Cys Arg His Leu Ser Cys Val Leu Ser Gly Leu He 
210 215 220 

Ala Asp Leu Asp Ala Leu Asp Val Cys Gly Arg Thr Glu He Arg Asn 
225 230 235 240 

Tyr Arg Arg Glu Val Val Glu Asp He Asn Lys Leu Leu Lys Tyr Leu 
245 250 255 

Asp Leu Glu Glu Glu Ala Asp Thr Thr Lys Ala Phe Asp Leu Arg Gin 

260 265 270 

Asn His Ser lie Leu Lys He Glu Lys Val Leu Lys Arg Met Arg Glu 
27S 280 285 

He Lys Asn Glu Leu Leu Gin Ala Gin Asn Pro Ser Glu Leu Tyr Leu 
290 295 300 

Ser Ser Lys Thr Glu Leu Gin Gly Leu He Gly Gin Leu Asp Glu Val 
305 310 315 320 

Ser Leu Glu Lys Asn Pro Cys lie Arg Glu Ala Arg Arg Arg Ala Val 
325 330 335 

He Glu Val Gin Thr Leu He Thr Tyr He Asp Leu Lys Glu Ala Leu 
340 345 350 

Glu Lys Arg Lys Leu Phe Ala Cys Glu Glu His Pro Ser His Lys Ala 
355 360 365 

Val Trp Asn Val Leu Gly Asn Leu Ser Glu He Gin Gly Glu Val Leu 
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370 375 

Ser Phe Asp Gly Asn Arg Thr Asp 
385 390 

Leu Leu Thr Lys Gin Leu Leu Ala 
405 

Glu Glu Lys Cys Lys Ala Ala Arg 
420 

Asn lie Leu Ser Tyr Leu Asp Leu 

435 440 



380 

Lys Asn Tyr lie Arg Leu Glu Glu 
395 400 

Leu Asp Ala Val Asp Pro Gin Gly 
410 415 

Lys Gin Ala Val Arg Leu Ala Gin 
425 430 

Lys Ser Asp Glu Trp Glu Tyr 
445 
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NOVEL BAG PROTEINS AND 
NUCLEIC ACID MOLECULES ENCODING THEM 

STATEMENT AS TO RIGHTS TO INVENTIONS MADE 
UNDER FEDERALLY- SPONSORED RESEARCH AND DEVELOPMENT 

5 This invention was made with government support 

under grant number CA-67329 awarded by the National 
Institutes of Health. The United States Government has 
certain rights in this invention. 

BACKGROUND OF THE INVENTION 

10 FIELD OF THE INVENTION 

This invention relates generally to the fields of 
molecular biology and molecular medicine and more 
specifically to a novel family of proteins that can 
regulate protein folding. The functions of these proteins 
15 are potentially diverse, including promoting tumor cell 
growth and metastasis. 

BACKGROUND INFORMATION 

The Hsc70/Hsp70-f amily of molecular chaperones 
participate in protein folding reactions, controlling 

20 protein bioactivity, degradation, complex 

assembly/disassembly, and translocation across membranes. 
These proteins interact with hydrophobic regions within 
target proteins via a carboxyl (C) -terminal peptide binding 
domain, with substrate binding and release being controlled 

25 by the N-terminal ATP-binding domain of Hsc70/Hsp70. 
Hsc70/Hsp70-assisted folding reactions are accomplished by 
repeated cycles of peptide binding, refolding, and release, 
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whicn are coupled to ATP hydrolysis by the ATP-binding 
domain (ATPase) of Hsc7C/Hsp70 and by subsequent nucleotide 
exchange. The chaperone activity of mammalian Hsc70/Hsp70 
is regulated by partner proteins that either modulate the 
5 peptide binding cycle or that target the actions of these 
chaperones to specific proteins and subcellular 
compartments. DnaJ-family proteins ( Hd j - 1 /Hsp4 0 ; Hdj-2; 
Hdj-3) stimulate the ATPase activity of Hsc70/Hsp70, 
resulting in the ADP-bound state which binds tightly to 

10 peptide substrates. The Hip protein collaborates with 
Hsc70/Hsp70 and DnaJ homologues in stimulating ATP 
hydrolysis, and thus also stabilize Hsc70/Hsp70 complexes 
with substrate polypeptides, whereas the Hop protein may 
provide co-chaperone functions through interactions with 

15 the C-terminal peptide binding domain. 

The Bcl-2 associated athanogene-1 (bag-1) is 
named from the Greek word athanos , which refers to 
anti-cell death. BAG-1 was previously referred to as 
Bcl-2-associated protein-1 (BAP-1) in U.S. Patent No. 
5,539,094 issued July 23, 1996, which is incorporated 
herein by reference. In this earlier patent, BAG-1 is 
described as a portion of the human BAG-1 protein, absent 
the N-terminal amino acids 1 to 85. In addition, a human 
protein essentially identical to human BAG-1 was described 
by Zemer and Gehring, {Proc. Natl. Acad. Sci., USA 
92:11465-11469 (1995)). Subsequent to the issuance of U.S. 
Patent 5,539,094 the N-terminal amino acid sequence from i 
to 85 of human BAG-1 was reported. 

BAG-1 and its longer isoforms BAG-1M (Rap46) and 
30 BAG- 1 L are recently described Hsc7 0 /Hsp7 O-regulating 
proteins. BAG-1 competes with Hip for binding to the 
Hsc70/Hsp70 ATPase domain and promotes substrate release. 
BAG-1 also reportedly stimulates Hsc7 0-media ted ATP 



20 



25 
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hydrolysis by accelerating ADP/AT? exchange, analogous to 
the prokaryjtic GrpE nucleotide exchange protein of the 
bacterial Hsc70 homologue, DnaK. Gene transf ect ion studies 
indicate that BAG - 1 proteins can influence a wide variety 
5 cf cellular phenotypes through their interactions with 
Hsc70/Hsp70, including increasing resistance to apoptosis, 
promoting cell proliferation, enhancing tumor cell 
migration and metastasis, and altering transcriptional 
activity of steroid hormone receptors. 



10 Despite the notable progress in the art, there 

remains an unmet need for the further identification and 
isolation of additional homologous BAG protein species, and 
the nucleic acid molecules and/or nucleotide sequences 
that encode them. Such species would provide additional 

15 means by which the identity and composition of the BAG 
domain, that is, the portion of the protein that is 
influencing or modulating protein folding, could be 
identified. In addition, such species would be useful for 
identifying agents that modulate apoptosis as candidates 

20 for therapeutic agents, in particular, anticancer agents. 
The present invention satisfies these need, as well as 
providing substantial related advantages. 



SUMMARY OF THE INVENTION 



The present invention provides a family of BAG-1 
25 related proteins from humans [BAG- 1L (SEQ ID NO:2), BAG-1 
(beginning at residue 116 of SEQ ID NO : 2 ) , BAG- 2 (SEQ ID 
NO: 4), BAG- 3 (SEQ ID NO : 6 ) and (SEQ ID NO:20), BAG- 4 (SEQ 
ID NO:8) and (SEQ ID NO:22) and BAG- 5 (SEQ ID NO:10) and 
(SEQ ID NO:24)j , the invertebrate C.elegans [BAG-1 (SEQ ID 
30 N0:12), BAG-2 (SEQ ID N0:14)] and the fission yeast S.pombe 
[ BAG- 1 A (SEQ ID NO : 1 6 ) , BAG-13 (SEQ ID NO:18)] and the 
nucleic acid molecules that encode them. 
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Another aspect of the present invention provides 
an amino acid sequence present in the family of BAG - 1 
related proteins, that modulates Hsc70/Hsp70 chaperone 
activity, that is, the BAG domain. 

5 Another aspect of the present invention provides 

novel polypeptide and nucleic acid compositions and methods 
useful in modulating Hsc70/Hsp70 chaperone activity. 

Another aspect of the present invention is 
directed to methods for detecting agents that modulate the 
10 binding of the BAG family of proteins, such as BAG- 1 
(beginning at residue 116 of SEQ ID N0:2), and related 
proteins with the Hsc70/Hsp70 Family of proteins or with 
other proteins that may interact with the BAG-Family 
proteins . 

15 Still another aspect of the present invention is 

directed to methods for detecting agents that induce the 
dissociation of a bound complex formed by the association 
of BAG-Family proteins with Hsc70/Hsp70 Family molecule 
chaperones or other proteins . 

2 0 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the full length cDNA sequence for 
human BAG- 1 (SEQ ID NO : 1 ) protein with the corresponding 
amino acid sequence (SEQ ID NO : 2 ) . Within the full length 
sequence are included the overlapping sub-sequences of 
25 BAG- 1 (beginning at nucleotide 391), BAG- 1M [beginning at 
nucleotide 260 of (SEQ ID NO:2)], and BAG - 1L [beginning at 
nucleotide 46 of (SEQ ID NO : 2 ) ] . 
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Figures 2A and 23 combined shows the full length 
c DMA sequence (SEQ ID NO : 3 ) aligned with the corresponding 
amino acid residues for human EAG-2 protein (SEQ ID NO : 4 ) . 

Figure 3 shows a cDNA sequence ( SEQ ID NO : 5 ) 
5 aligned with the corresponding amino acid residues for 
human BAG -3 protein ( SEQ ID N0:6). 

Figure 4 shows the a cDNA sequence (SEQ ID NO : 7 ) 
aligned with the corresponding amino acid residues for 
human BAG- 4 protein (SEQ ID NO : 8 ) . 

10 Figure 5 shows a cDNA sequence (SEQ ID NO : 9 ) 

aligned with the corresponding amino acid residues for 
human BAG -5 protein (SEQ ID NO:10). 

Figure 6A shows the full length cDNA sequence for 
C. elegans BAG- 1 protein (SEQ ID NO:ll). 

15 Figure 6B shows the 210 amino acid sequence for 

0. elegans BAG- 1 protein (SEQ ID NO:12). 

Figure 7A shows the full length cDNA sequence for 
C. elegans BAG- 2 protein (SEQ ID NO:13). 



2 0 C. e 



Figure 7B shows the 458 amino acid sequence for 
legans BAG -2 protein (SEQ ID NO:14). 



Figure 8A shows the full length cDNA sequence for 
5. pombe BAG- 1 A protein (SEQ ID NO: 15) . 

Figure 83 shows the 195 amino acid sequence for 
5. pombe BAG- 1 A protein (SEQ ID NO: 16) . 
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Figure 9A shows the full length cDNA sequence for 
S. pombe BAG-IB protein ( SEQ I D NO:17). 



Figure 93 shows the 206 amino acid sequence for 
5. pombe BAG-IB protein (SEQ ID NO : 1 8 ) . 

5 Figure 10 shows the topologies of the BAG-family 

proteins; human BAG proteins, BAG- 1 { SEQ ID NO : 2 ) , BAG-2 
(SEQ ID N0:4), BAG-3 (SEQ ID NO : 6 ) , BAG- 4 (SEQ ID NO : 8 ) , 
BAG - 5 (SEQ ID NO: 10); S . pombe BAG- 1 A (SEQ ID NO: 16) and 
BAG-IB ( SEQ ID NO:18); and C. elegans BAG- 1 (SEQ ID 

10 NO:12)and BAG-2 (SEQ ID NO:14). (A) The relative 

positions of the BAG domains are shown in black, ubiquitin- 
like regions are represented in gray, WW domain are 
represented in strips. Nucl eopl asmin-li ke nuclear 

localization sequence are also shown. (B) The amino acid 

15 sequences of the BAG domain for human BAG- 1 (SEQ ID NO : 2 ) , 
BAG-2 (SEQ ID NO:4), BAG-3 (SEQ ID NO : 6 ) , BAG-4 (SEQ ID 
NO: 8), BAG-5 ( SEQ ID NO: 10), S. pombe BAG-1A (SEQ ID 
NO: 16) and BAG-IB (SEQ ID NO: 18), and C. elegans BAG-i (SEQ 
ID NO: 12) and BAG-2 (SEQ ID NO:14) are aligned demonstrating 

20 their homology. Black and gray shading represent identical 
and similar amino acids, respectively. 



Figure 11 shows assays demonstrating the 
interaction of BAG-family proteins with Hsc7 0 /ATPase . (A) 
Two-hybrid assays using yeast expressing the indicated 

25 fusion proteins. Blue color indicates a positive 

interaction, resulting in activation of the lacZ reporter 
gene. (B) In vitro protein assays using GST-fusion 

proteins and 3S S-labeIed in vitro translated proteins. (C) 
Co-immunoprecipi tation assays using anti-Flag or IgGl 

30 control antibodies and lysates from 293T cells expressing 
Flag-tagged BAG- 1 (beginning at residue 116 of SEQ ID 
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NO:2), BAG- 2 (SEQ ID NO : 4 ) , BAG- 3 (SEQ ID NO:6), Daxx, or 
Apaf -1 . 

Figure 12 snows surface plasmon resonance 
analysis of EAG-famiiy protein interactions with 
5 Hsc70/ATPase. (A) SDS-PAGE analysis of purified 

recombinant proteins. (B) Representative SPR results of 
biosensor chips containing immobilized BAG proteins with 
and without maximally bound H s c 7 0 / AT ? a s e . 

Figure 13 shows representative SPR results for 
10 biosensor chips containing immobilized BAG-1 (beginning at 
residue 116 at SEQ ID NO : 2 ) , BAG-1 (aC) , BAG- 2 { SEQ ID NO : 4 ) , 
or BAG- 3 ( SEQ ID NO : 6 ) proteins. Hsc70/ATPase was flowed 
over the chips (arrow/left) until maximal binding was 
reached (response units), then flow was continued without 
15 Hsc70/ATPase ( arrow/ right ) . For BAG-2 (SEQ ID NO : 4 ) and 
BAG- 3 (SEQ ID N0:6), Hsc70 was injected at 0.0175, 0.035, 
0. 07, 0.14, and 0.28 uM . 

Figure 14 shows BAG-family protein modulation of 
Hsc70 chaperone activity. (A) Protein refolding assay of 

20 chemically-denatured luciferase by Hsc70 plus DnaJ in the 
absence or presence of BAG and BAG-mutant proteins. (B) 
Concentration-dependent inhibition of Hsc7 0-medi a ted 
protein refolding by BAG-family proteins [BAG-1 (beginning 
at residue 116 of SEQ ID NO : 2 ) , BAG-2 (SEQ ID NO : 4 ) , BAG -3 

25 (SEQ ID NO: 6)] but not by BAG-mutant (BAG-1 (AC). (C) 
Hsc70/Hsp4 0-mediated refolding of heat-denatured luciferase 
was assayed in the presence of (black bars) or absence of 
(striped bars) of 1.8 uM Hip, with (lanes 3-10) or without 
(lanes 1,2) various BAG-family proteins (1.8uM) as 

30 indicated (mean ±SE; n = 3) . A control (CNTL) is shown (lane 
1) in which Hsc70 was replaced with an equivalent amount of 
BSA. 
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Figure ISA shows an expanded cDNA sequence for 
human BAG-3 protein (SEQ ID NO : 1 9 ) . 

Figure 15B shows the corresponding amino acid 
residues for the human BAG-3 protein ( SEQ ID NO:20) of 
5 Figure ISA. 

Figure ISC shows the expanded cDNA sequence (SEQ 
ID NO: 19) aligned with the corresponding amino acid 
residues for human BAG-3 protein of Figure 15A (SEQ ID 
NO: 20) . 

10 Figure 1 6A shows an expanded cDNA sequence for 

human BAG- 4 protein ( SEQ ID NO : 2 1 ) . 

Figure 16B shows the corresponding amino acid 
residues for the human BAG-4 protein of Figure 16A (SEQ ID 
NO: 22) . 

15 Figure 16C shows the expanded cDNA sequence (SEQ 

ID NO: 21) aligned with the corresponding amino acid 
residues for human BAG- 4 protein of Figure 1 6A (SEQ ID 
NO: 22 ) . 

Figure 17A shows an expanded cDNA sequence for 
20 human BAG-S protein (SEQ ID NO : 2 3 ) . 

Figure 17B shows the corresponding amino acid 
residues for the human BAG-5 protein of Figure 17A (SEQ ID 
NO : 24 ) . 

Figure 17C shows the expanded cDNA sequence (SEQ 
25 ID NO: 23) aligned with the corresponding amino acid 
residues for human BAG-S protein of Figure 17A (SEQ ID 
NO : 2 4 ) . 
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Figure 18 shows the topologies of the BAG -family 
proteins; human BAG proteins, BAG - 1. ( SEQ ID NO : 2) , BAG -2 
(SEQ ID NO:4), expanded 3AG-3 (SEQ ID NO : 2 0 ) , expanded 
BAG -4 (SEQ ID NO:22), expanded BAG- 5 (SEQ ID NO:24); 
5 S.pombe BAG - 1 A (SEQ ID NO: 16) and BAG- 13 (SEQ ID NO: 18); and 
C. elegans BAG- 1 (SEQ ID NO:12)and BAG- 2 (SEQ ID NO:14). 
The relative positions of the BAG domains are shown in 
black, ubiqui tin-like regions are represented in gray, WW 
domain are represented in strips. Nucleoplasmin-iike 
10 nuclear localization sequence are also shown. 

Definitions 

The term "apoptosis", as used herein, refers to 
the process of programmed cell death, although not all 
programmed cell deaths occur through apoptosis, as used 
15 herein, "apoptosis" and "programmed cell death" are used 
interchangeably. 

The term "tumor cell proliferation", as used 
herein refers to the ability of tumor cells to grow and 
thus expand a tumor mass . 

20 The term "cell migration", as used herein refers 

to the role cell motility plays in the invasion and 
potentially metastasis by tumor ceils. 

The term "metastasis", as used herein refers to 
the spread of a disease process from one part of the body 
25 to another, as in the appearance of neoplasms in parts of 
the body remote from the site of the primary tumor; results 
in dissemination of tumor cells by the lymphatics or blood 
vessels cr by direct extension through serious cavitites or 
subarachnoid or other spaces. 
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The term "steroid hormone receptor function", as 
used herein refers to physiological, cellular and molecular 
functioning of receptors sites that bind with steroid 
hormones . 

5 The term "substantially purified", as used 

herein, refers to nucleic acid or amino acid sequence that 
are removed from their natural environment, isolated or 
separated, and are at least 60% free, preferably 75% free, 
and most preferably 90% free from other components with 
10 which they are naturally associated. 

"Nucleic acid molecule" as used herein refers to 
an oligonucleotide, nucleotide, or polynucleotide, and 
fragments or portions thereof, and to DNA or RNA of genomic 
or synthetic origin which may be single or double stranded, 
15 and represent the sense or antisense strand. 

"Hybridization", as used herein, refers to any 
process by which a strand of nucleic acid binds with a 
complementary strand through base pairing. 

The terms "complementary" or "complementarity", 
20 as used herein, refer to the natural binding of 
polynucleotides under permissive salt and temperature 
conditions by base-pairing. For example, the sequence 
"A-G-T binds to the complementary sequence "T-C-A". 

The term "homology", as used herein, refers to a 
25 degree of complementarity. There may be partial homology 
or complete homology (i.e., identity). A partially 

complementary sequence is one that at least partially 
inhibits an identical sequence from hybridizing to a target 
nucleic acia and is referred to using the functional term 
30 "substantially homologous." The inhibition of 
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hybridization of the completely complementary sequence to 
the target sequence may oe examined using a hybridza t ion 
assay (Southern or northern blot, solution, hybridization 
a no the like) under conditions of low stringency. A 
5 substantially homologous sequence or probe will compete for 
and inhibit the binding (i.e., the hybridization) of a 
completely homologous sequence or probe to the target 
sequence under conditions of low stringency. 

The term " an ti sense " , as used herein, refers to 
10 nucleotiae sequences which are commp 1 ement ary to a specific 
DNA or RNA sequence. The term "antisense strand" is used in 
reference to a nucleic acid strand that is complementary to 
the "sense" strand. Antisense molecules may be produced by 
any method, including synthesis by ligating the gene(s) of 
15 interest in a reverse orientation to a viral promoter which 
permits the synthesis of a complementary strand. Once 
introduced into a cell, this transcribed strand combines 
with natural sequences produced by the cell to form 
duplexes. These duplexes then block either the further 
20 transcription or translation. In this manner, mutant 
phenotypes may be generated. The designation "negative" is 
sometimes used in reference to the antisense, and 
"positive" is sometimes used in reference to the sense 
strand . 

25 "Amino acid sequence" as used herein refers to an 

oligopeptide, peptide, polypeptide, or protein sequence, 
and fragments or portions thereof, and to naturally 
occurring or synthetic molecules. Where "amino acid 
sequence" is recited herein this term excludes an amino 

30 acid sequence of a naturally occurring protein. "Amino 
acid sequence' 1 , "polypeptide" or "protein" are not meant to 
limit the amino acid sequence to the complete, native amino 
acid sequence associated with the recited protein molecule. 
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The term "functional fragments" or "fragments", 
as used herein, with regard to a protein refers to portions 
of that protein that are capable of exhibiting or carrying 
out the activity exhibited by the protein as a whole. The 
5 portions may range in size from three amino acid residues 
tc the entire amino acid sequence minus one amino acid. 
For example, a protein "comprising at least a functional 
fragment of the amino acid sequence of SEQ ID NO : 1 " , 
encompasses the full-length of the protein of SEQ ID NO:l 
10 and portions thereof. 



A "derivative" of a BAG protein, as used herein, 
refers to an amino acid sequence that is alterd by one or 
more amino acids. The derivative may have ''conservative" 
changes, wherein a substituted amino acid has similar 

15 structural or chemical properties, e.g., substitution of an 
apolar amino acid with another apolar amino acid (such as 
replacement of leucine with isoleucme) . The derivative 
may also have "nonconservati ve" changes, wherein a 
suostituted amino acid has different but sufficiently 

20 similar structural or chemical properties that permits such 
a substitution without adversely effecting the desired 
biological activity, e.g., replacement of an amino acid 
with an uncharged polar R group with an amino acid with an 
apolar R group (such as replacement of glycine with 

25 tryptophan), or alternatively replacement of an amino acid 
with a charged R group with an amino acid with an uncharged 
Polar R group (such as replacement of lysine with 
asparagine ) . 
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A^:no Acids - Apolar R Groups 



Amino Acid 


Radical 


Abbreviations 


3-Let ter 


1-Letter 


alanine 


methyl 


aia 


A 


valine 


2 -propyl 


aal 


V 


leucine 


2 -me thy 1 propyl 


leu 


T 

— 1 


isoleucine 


2-butyl 


ile 


I 


proline 


propyl * - cycli zed 


pro 


P 


phenylalanine 


benzyl 


phe 


F 


trytophan 


3-indolylmethl 


t y r 


W 


methionine 


me thyl thioethyl 


met 


M 


Amino Acids - Uncharged Polar R Groups 


Amino Acid 


Radical 


Abbreviations 


3-Letter 


1-Letter 


glycine 


H 


gly 


G 


serine 


hydroxymethyl 


ser 


S 


threonine 


1 -hydroxyethyl 


thr 


T 


cysteine 


thiolmethyl 


cys 


C 


tyrosine 


4 -hydroxypheny Imethyl 


ty r 


Y 


asparagine 


aminocarbonylme thy 1 


asn 


N 


glut amine 


aminocarbonyl ethyl 


gin 


Q 


Amino Acids - Charged R Groups 


Amino Acid 


Radical 


Abbreviations 


3-Letter 


1-Letter 


aspartic acid 


carboxymet hyl 


asp 


D 


glutamic acid 


carboxye thy 1 


giu 


E 


1 ysine 


4 -ammobutyl 


iys 


K 


arginine 


3 -guanylpropyl 


ar g 


R 


histidine 


4 - imi da zoy Imethyl 


his 


H 
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Similar minor modifications may also include amino acids 
deletions or insertions or both. Guidance in determining 
which amino acid residues may be modified as indicated 
above without abolishing the desired biological 
5 functionality may be determined using computer programs 
well known in the art, for example, DNASTAR software. In 
addition, the derivative may also result from chemical 
modifications to the encoded polypeptide, including but not 
limited to the following, replacement of hydrogen by an 

10 alkyl, acyl, or amino group; es teri f ica tion of a carboxyl 
group with a suitable alkyl or aryl moiety; alkylation of 
a hydroxyl group to form an ether derivative. Further a 
derivative may also result from the substitution of a re- 
configuration amino acid with its corresponding D- 

15 configuration counterpart. 

The term "mimetic", as used herein, refers to a 
molecule, the structure of which is developed from 
knowledge of the structure of a protein/polypeptide or 
portions thereof (such as BAG- 1 ) and, as such, is able to 
20 effect some or all of the actions of 3AG-1 protein. 

"Peptide nucleic acid", as used herein, refers to 
a molecule which comprises an oligomer to which an amino 
acid residue, such as lysine, and an amino group have been 
added. These small molecules, also designated anti-gene 
25 agents, stop transcript elongation by binding to their 
complementary strand of nucleic acid (Nielsen, P.E. et al., 
Anticancer Drug Des . 8:53-63 (1993)). 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a family of BAG-1 
30 related proteins from humans [ BAG- 1L (SEQ ID N0:2), BAG-IS 
beginning at residue 116 of SEQ ID NO:2, BAG - 2 (SEQ ID 
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N0:4), BAG- 3 ( SEQ ID NO : 6 ) and (SEQ ID NO:2C), BAG- 4 (SEQ 
ID NO: 8, and (SEQ ID NO:22) and 3AG-5 ( SEQ ID NC:1C) and 
(SEC ID NO:24)], the invertebrate C.elegans { BAG- I (SEQ ID 
NO: 12), BAG -2 (SEQ ID NO: 14'] and the fission yeast S.pombe 
5 [EAG-IA (SEQ ID NO : 1 6 ) , BAG-IB (SEQ ID NO : 1 8 ) ] , 
specifically the full length amino acid sequences 
comprising human 3AG-1L (SEQ ID NO : 2 ) , BAG- 1 (beginning at 
residue 116 of SEQ ID NO : 2 ) , and BAG-2 (SEQ ID NO : 4 ) C. 
elagans BAG - 1 (SEQ ID NO:12), and BAG-2 (SEQ ID NO:14), and 

10 S.pombe BAG- 1A (SEQ ID NO: 16) and BAG-IB (SEQ ID NO: 18); 
and partial sequences comprising human BAG-2 (SEQ ID NO: 6) 
and (SEQ ID NO:20), BAG -4 (SEQ ID NO : 8 ) and (SEQ ID NO:22), 
and BAG- 5 (SEQ ID NO : 1 0 ) and (SEQ ID NO:24) and functional 
fragments thereof. In particular, the invention provides 

15 the amine acid sequences comprising human BAG-2 (SEQ ID 
NO:4), BAG- 3 (SEQ ID NO: 6) and (SEQ ID NO:20), BAG-4 (SEQ 
ID NO:8) and (SEQ ID NO:22), and BAG- 5 (SEQ ID NO:10) and 
(SEQ ID NO:24) proteins. 



Another aspect of the present invention provides 
20 the nucleic molecule and nucleotide sequences that encode 
the family of BAG- 1 related proteins from humans [ BAG- 1 
(SEQ ID NO:l), BAG-2 (SEQ ID NO : 3 ) , BAG -3 (SEQ ID NO : 5 ) and 
(SEQ ID NC:19), BAG-4 (SEQ ID NO : 7 ) and (SEQ ID NO:21) and 
BAG- 5 (SEQ ID NO : 9 ) and (SEQ ID NO:23)], the invertebrate 
25 C.elegans [ BAG- 1 (SEQ ID NO:ll), BAG-2 { SEQ ID NO:13)] and 
the fission yeast S.pombe [ BAG- 1 A (SEQ ID NO:15), BAG-IB 
(SEQ ID NO: 17 ) ] . 



BAG-1L (SEQ ID NO : 2 ) is a multifunctional protein 
that blocks apoptosis, promotes tumor cell metastasis, and 
30 contributes to factor-independent and p53-resistant cell 
growth. BAG-1L (SEQ ID NO:2) interacts with several types 
of proteins, including 3cl-2, some tyrosine kinase growth 
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factor receptors, steroid hormone receptors, and the p53- 
induced cell cycle regulator Siah-IA. 



BAG- 1 is a regulator of Hsc70/Hsp70 family 
molecular chaperones . A carboxyl -terminal domain in this 
5 protein binds tightly to the ATPase domains of Hsc70 and 
Hsp70 (K D = 1 nM) (Zeiner, M., Gebauer, M., and Gehring, U., 
EMBO J. 16: 5483-54 90, (1997)). BAG - 1 modulates the 
activity of these molecular chaperones, acting as an 
apparent functional antagonist of the Hsp70/Hsc70- 

10 associated protein Hip (3-5) (Hohfeld, J. and Jentsch, S., 
EMBO J. 16: 6209-6216, (1997); Takayama, S., Bimston, D . 
N., Matsuzawa, S., Freeman, B. C, Aime-Sempe, C., Xie, Z., 
Monmoto, R. J . , and Reed, J. C, EMBO J. 16: 4887-96, 
(1997); Zeiner, M . , Gebauer, M . , and Gehring, U., EMBO J. 

15 16: 5483-5490, (1997)). In general, protein refolding is 
accomplished by Hsp70/Hsc70 through repeated cycles of 
target peptide binding and release, coupled to ATP 
hydrolysis lEllis, R., Curr Biol. 7: R531-R533, (1997)). 
BAG- 1 appears to promote substrate release, whereas Hip 

20 stabilizes Hsp70/Hsc70 complex formation with target 
peptides (Hbhfeld, J., Minami, Y., and Hartl, F.-U., Cell. 
83: 589-598, (1995)). Since each substrate interaction 
with Hsc70/Hsp70 is unique in terms of the optimal length 
of time the protein target should remain complexed with 

25 Hsc70/Hsp70 for achieving new conformations, the net effect 
of BAG- 1 can be either enhancement or inhibition of the 
refolding reaction. 

The 70kd heat shock family proteins (Hsp70/Hsc70 ) 
are essential to a variety of cellular processes and have 
30 been implicated in cancer, yet it is unclear how these 
proteins are regulated in vivo. A variety of co-chaperones 
have been identified which may target Hsp70/Hsc70 to 
different subcellular compartments or promote their 
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interactions with specific protein or protein complexes. 
BAG - 1 appears to represent a novel Hsp70/Hsc70 regulator 
which differs functionally from all other mammalian co- 
chaperones identified to date, such as members of the 
5 DnaJ-, Hip-, Hop-, and cyclophil in- families of proteins. 

Another aspect cf the present invention provides 
the amino acid sequence of a binding domain of about 40 no 
55 amino acids that bind the a Hsc70/Hsp70 ATPase domain. 
The BAG domain is situated near the C-terminus, and the 
10 ubiquit in- like domains are situated near the N-terminus. 

The BAG family of proteins of the present 
invention contain a common conserved C-terminal domain (the 
"BAG" domain) that facilitates binding to the ATPase domain 
of Hsp70/Hsc70. The carboxyl - terminal domain of BAG - 1 

15 binds to the ATPase domain of Hsc70/Hsp70 and regulates its 
chaperone function by acting as a ADP-ATP exchange factor. 
Other domains of BAG- 1 mediate interactions with proteins 
such as Bcl-2 and retinoic acid receptors (RARs) , allowing 
BAG - 1 to target Hsc70/Hsp70 to other proteins, presumably 

20 modulating their function by changing their conformations. 

Human BAG- 1 was previously shown to inhibit 
Hsc70/Hsp70 dependent refolding of denatured protein 
substrates in vitro (S. Takayama , et al . , EMBO J 16, 4887- 

96 (1997) ; M. Zeiner, M. Gebauer, U. Gehring, EMBO J. 16, 

25 5483-5490 (1997); and J. Hohfeld, S. Jentsch, EMBO J. 16, 

6209-6216 (1997)) . In Example III, Part A the effects of 
recombinant human BAG - 1 , BAG - 2 (SEQ ID NO : 4 ) and BAG - 3 (SEQ 
ID NO : 6 ) were compared using in vitro protein refolding 

assays similar to those employed previously for assessing 
3 0 BAG - 1 . The study showed that addition of equimolar amounts 
of each of the recombinant proteins to Hsc70 resulted in 
significant inhibition of luciferase refolding, with BAG - 2 
(SEC ID NO: 4) and BAG- 3 (SEQ ID NO : 6 ) showing somewhat 
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greater inhibitor activity than BAG - 1 (Figure 4A) . In a 
separate lucif erase folding study BAG - 1 , BAG - 2 (SEQ ID 
NO : 4 ) and BAG - 3 ( SEQ ID NO : 6 ) once again displayed 
inhibition of luciferase refolding, however in this study 
5 varying amounts of BAG - 1 , BAG- 2 (SEQ ID NO : 4 ) and BAG- 3 
(SEQ ID NO : 6 ) were added relative to Hsc70 which resulting 
in concentration-dependent inhibition of Hsc70 chaperone 
activity, i.e., luciferase folding (Example III Part A). 
Additional follow on studies using the same experimental 
10 protocols as the previous studies, as taught in Example 
IIA, have shown that BAG - 4 (SEQ ID NO: 22) also undergoes 
association with Hsc70 /ATPase . 



Yet another aspect of the present invention 
provides a nucleotide sequence having at least about 15 

15 nucleotides and, generally, about 25 nucleotides, 
preferably about 35 nucleotides, more preferably about 45 
nucleotides, and most preferably about 55 nucleotides that 
can hybridize or is complementary under relatively 
stringent conditions to a portion of the nucleic acid 

20 sequences shown in Figures 1-9 and Figures 15-17, in 
particular the BAG domain as shown in in Figure IB, e.g., 
nucleotides 552-593 of human BAG- 3 , or nucleotides 167-221 
of human BAG -4 . 



Yet another aspect of the present invention 
2 5 provides a compound of the formula, 

R N -R 1 X 1 R 2 X 2 R 3 X 3 R 4 X 4 R 5 X 5 R 6 X 6 R 7 X 7 X 8 R 9 X 9 R 10 X 10 R 11 X 11 -R C 



wherein, 

R N is a group of 1 to 552 independently selected 
amino acids ; 

3 0 R 1 is a group of 3 independently selected amino 

acids ; 
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X : is an amino acid wirh a charged or uncharged 
R group, such as aspartic acid, glutamic acid, asparagine, 
or glutamine; 

R : is a group of 7 independently selected amino 

5 acids; 

X 2 is an amino acid with a charged R group, such 
as glutamic acid; 

R 3 is a group of 5 independently selected amino 

acids ; 

10 X 3 is an amino acid with an apolar R group, such 

as leucine, methionine, or isoleucine; 

R 4 is a group of 3 independently selected amino 

acids ; 

X 4 is an amino acid with charged R group, such as 
15 aspartic acid or glutamine acid; 

R 5 is a single independently selected amino acid; 

X 5 is an amino acid with apolar or uncharged R 
group, such as leucine, valine, methionine, alanine or 
threonine ; 

20 R 6 is a group of 15 independently selected amino 

acids ; 

X 6 is an amino acid with a charged or uncharged 
R group, such as arginine, lysine, glutamine or aspartic 
acid ; 

25 R 7 is a group of 2 independently selected amino 

acids ; 

X 7 is an amino acid with a charged R group, such 
as arginine; 

X 8 is an amino acid with a charged R group, such 
30 as arginine or lysine; 

R 9 is a group of 2 independently selected amino 

acids ; 

X 9 is an amino acid with an apolar R group, such 

as valine ; 

35 R 10 is a group of 3 independently selected amino 

acids ; 
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X 10 is an amino acid with an uncharged R group, 
such as glutamine; 

R 1 1 is a group of 2 independently selected amino 

acids ; 

5 X xl is an amino acid with an apolar R group, such 

as leucine; and 

R c is a group of 1 to 100 independently selected 
amino acids. 

A nucleotide sequence of at least about 15 
10 nucleotides and, generally, about 25 nucleotides, 
preferably about 35 nucleotides, more preferably about 45 
nucleotides, and most preferably about 55 nucleotides can 
be useful, for example, as a primer for the polymerase 
chain reaction (PCR) or other similar reaction mediated by 
15 a polymerase such as a DNA or RNA polymerase (see PCR 
Protocols: A guide to methods and applications, ed. Innis 
et al . (Academic Press, Inc., 1990), which is incorporated 
herein by reference; see, for example, pages 40-41) . In 
addition, such a nucleotide sequence of the invention can 
20 be useful as a probe in a hybridization reaction such as 
Southern or northern blot analysis or in a binding assay 
such as a gel shift assay. 



A nucleotide sequence of the invention can be 
particularly useful as an antisense molecule, which can be 

25 DNA or RNA and can be targeted to all or a portion of the 
5 ' -untranslated region or of the 5 ' -translated region of a 
bag-1 nucleic acid sequence in a cell. For example, an 
antisense molecule can be directed to at least a portion of 
the sequence shown as the BAG domain in Figure 1A, e.g., 

30 nucleotides 272-319 of human BAG - 1L ( SEQ ID NO : 1 ) , or 
nucleotides 79-147 of human BAG- 5 (SEQ ID NO : 9 ) . Since the 
5 1 -region of a nucleic acid contains elements involved in 
the control of expression of an encoded protein, an 
antisense molecule directed to the 5 '-region of a nucleic 



RNSDOCin <WO 0014106A1 ia> 



WO 00/14106 PCT7US99/21053 

21 

acid molecule can affect: the levels of protein expressed in 
a cell . 



A nucleotide sequence of the invention also can 
be useful as a probe to identify a genetic defect due a 
5 mutation of a gene encoding a BAG protein in a cell. Such 
a genetic defect can lead to aberrant expression of a BAG 
protein in the cell or to expression of an aberrant BAG 
protein, which does not properly associate with a Bcl-2- 
related protein or Hsc70/Hsp70 protein in the cell. As a 
10 result, a genetic defect in a gene encoding, for example, 
human 3AG-1 can result in a pathology characterized by 
increased or decreased levels in protein folding. 



Further a nucleotide compound or composition as 
taught in the present invention can be synthesized using 

15 routine methods or can be purchased from a commercial 
source. In addition, a population of such nucleotide 
sequences can be prepared by restriction endonuclease or 
mild DNAse digestion of a nucleic acid molecule that 
contains nucleotides as shown in the nucleotide sequences 

20 shown in Figures 1-9 and Figures 15-17 that encodes the 
amino acids sequences also shown in Figures 1-9 and 
Figures 15-17. Methods for preparing and using such 
nucleotide sequences, for example, as hybridization probes 
to screen a library for homologous nucleic acid molecules 

25 are well known in the art (see, for example, Sambrook et 
al . , Molecular Cloning: A laboratory manual (Cold Spring 

Harbor Laboratory Press 1989); Ausubel et al . , Current 

Protocols in Molecular Biology (Green Publ . , NY 1989), 

each of which is incorporated herein by reference) . 



30 A particular nucleotide sequence can be designed 

based, for example, on a comparison of the nucleic acid 
molecules encoding any one of the BAG family proteins, as 
shown in Figures 1-9 and Figures 15-17, with another in the 
family. Such a comparison allows, for example, the 
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preparation of a nucleotide sequence that will hybridize to 
a conserved region present in both nucleic acid molecules, 
thus providing a means to identify homologous nucleic acid 
molecules present in other cell types or other organisms. 
In addition, such a comparison allows the preparation of a 
nucleotide sequence that will hybridize to a unique region 
of any of the BAG family nucleotide sequences, such as 
those corresponding to the BAG domain, thus allowing 
identification of other proteins sharing this motif. In 
this regard, it is recognized that, while the human BAG- 3 
proteins shown as Figures 3 and 20, and human BAG- 5 
proteins shown as Figures 5 and 24, are only partial 
sequences, a variant human BAG- 3 or BAG- 5 produced, for 
example, by alternative splicing can exist and can be 
identified using an appropriately designed nucleotide 
sequence of the invention as a probe. Such useful probes 
readily can be identified by inspection of the sequences 
shown in the disclosed Figures by a comparison of the 
encoding nucleotide sequences. 

If desired, a nucleotide sequence of the 
invention can incorporate a detectable moiety such as a 
radiolabel, a f luorochrome , a ferromagnetic substance, a 
luminescent tag or a detectable binding agent such as 
biotin. These and other detectable moieties and methods of 
incorporating such moieties into a nucleotide sequence are 
well known in the art and are commercially available. A 
population of labelled nucleotide sequences can be 
prepared, for example, by nick translation of a nucleic 
acid molecule of the invention (Sambrook et al . , supra, 
1989; Ausubel et al . , supra, 1989). 

One skilled in the art would know that a method 
involving hybridization of a nucleotide sequence of the 
invention can require that hybridization be performed under 
relatively stringent conditions such that nonspecific 
background hybridization is minimized. Such hybridization 
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conditions can be determined empirically or can be 
estimated based, for example, on the relative GC content cf 
a sequence and the number of mismatches, if known, between 
the probe and the target sequence (see, for example, 
Sambrook et al . , supra, 1989). 



The invention further provides antibodies 
specific for human BAG family protein. As used herein, the 
term "antibody" includes polyclonal and monoclonal 
antibodies, as well as polypeptide fragments of antibodies 

10 that retain a specific binding activity for human BAG- 1 of 
at least about 1 x 10 5 M~ : . One skilled in the art would 
know that anti-BAG-1 antibody fragments such as Fab, F(ab») : 
and Fv fragments can retain specific binding activity for 
human BAG- 1 (beginning at residue 116 of SEQ ID NO : 2 ) and, 

15 thus, are included within the definition of an antibody. 
In addition, the term "antibody" as used herein includes 
naturally occurring antibodies as well as non-natural ly 
occurring antibodies and fragments that retain binding 
activity such as chimeric antibodies or humanized 

20 antibodies. Such non- naturally occurring antibodies can be 
constructed using solid phase peptide synthesis, can be 
produced recombinant ly or can be obtained, for example, by 
screening combinatorial libraries consisting of variable 
heavy chains and variable light chains as described by Huse 

25 et al., Science 246:1275-1281 (1989), which is incorporated 

herein by reference. 

One skilled in the art would know that purified 
BAG family protein, which can be prepared from natural 
sources or synthesized chemically or produced 

30 recombinant ly , or portions of a BAG family protein, 
including a portion of human BAG family protein such as a 
synthetic peptide as described above, can be used as an 
immunogen. Such peptides useful for raising an antibody 
include, for example, peptide portions of the N-terminal 85 

3 5 amino acids or the BAG domain cf any of the human BAG 
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proteins (see Figure IB) . A particularly advantageous use 
of such a protein is for the immunos t aining , wherein the 
methods provides a process to contrast the immunos t aining 
cf BAG-family proteins in carcinoma ceils with adjacent 
5 non-neoplastic prostatic epithelial and basal cells which 
are generally present in the same tissue sections. These 
results would be correlated with a Gleason grade to 
determine whether any of the BAG-family proteins tend to be 
expressed at higher or lower levels in histologically 
10 advanced tumors. From this process a determination can be 
made as to degree at which the disease is progressing in a 
given patient, i.e., a prognosis can be made. 

Non- immunogenic fragments or synthetic peptides 
of BAG proteins can be made immunogenic by coupling the 

15 hapten to a carrier molecule such bovine serum albumin 
(BSA) or keyhole limpet hemocyanin (KLH) , as described in 
Example IV, below. In addition, various other carrier 
molecules and methods for coupling a hapten to a carrier 
molecule are well known in the art and described; for 

2 0 example, by Harlow and Lane, Antibodies : A laboratory 
manual (Cold Spring Harbor Laboratory Press, 1988) , which 
is incorporated herein by reference . 

EXAMPLES 



The following examples are given to enable those 
25 skilled in the art to more clearly understand and to 
practice the present invention. They should not be 

considered as limiting the scope of the invention, but 
merely as being illustrative and representative thereof. 
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EXAMPLE I 

Isolation and Characterization 
of BAG-family cDNA Sequences 

This example describes methods for isolating and 
characterizing of BAG-family cDNA sequences from human, 
nematode and yeast . 

A^ Clonin g of human BAG cDNA sequences 

Yeast two-hybrid library screening of a human 
Jurkat cell cDNA library was performed as described by 
Takayama et al . , EMBO J . . 16:4887-96 (1997); Matsuzawa et 
al . , EMBO J. , 17:2736-2747 (1998), which are incorporated 
herein by reference) using EGY48 strain yeast transformed 
with pGilda-Hsc70/ATPase (67-377 amino acids) and the lacZ 
reporter plasmid pSH18-34. Of the resulting ~5 x 10 6 
transf ormants , 112 Leu" colonies were obtained after 
1 week incubation at 30°C. Assay of (3 -galactosidase {(5 -gal) 
activity of these colonies resulted in 96 clones. Mating 
tests were then performed using RFY206 yeast strain 
transformed with pGilda, pGilda mB AG - 1 (1-219) , or pGilda 
Hsc70/ATPase . Of these, 66 displayed specific interactions 
with Hsc70/ATPase . The p JG4 - 5 cDNAs were recovered using 
KC8 E . coli strain which is auxotrophic for tryptophan 
(Trp) . DNA sequencing revealed 3 partially overlapping 
human BAG - 1 , 4 identical and one overlapping cDNAs encoding 
BAG - 2 , and 2 partially overlapping BAG- 3 clones. 

Using the above described yeast two-hybrid screen 
with the ATPase domain of Hsc70 as "bait", several human 
cDNAs were cloned which encode portions of BAG- 1 or of two 
other BAG-l-like proteins which are termed BAG- 2 (SEQ ID 
NO:4) and BAG- 3 (SEQ ID NO : 6 ) . The longest of the cDNAs 
for BAG- 2 (SEQ ID NO : 3 ) and BAG- 3 (SEQ ID NO : 5 ) contained 
open reading frames (ORFs) of 207 and 162 amino acids, 
respectively, followed by stop ccdons . All BAG- 1 (SEQ ID 
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NO:l), BAG - 2 (SEQ ID NO : 3 ) and BAG - 3 (SEQ ID NO : 5 ) cDNAs 
obtained by two-hybrid library screening with Hsc70/ATPase 
contained a conserved domain of about 40-50 amino acids 
which are termed the "BAG" domain and are shown in Figure 
10. These results demonstrate that a family of BAG- 1 - 
related proteins all contain a conserved ~45 amino acid 
region near their C-terminus that binds Hsc70/Hsp70. 

B. Identification of additional BAG-family proteins 

A search of the translated Genbank database using 
the bBLAST and FAST A search programs also identified human 
ESTs that provided sequences for further investigation of 
BAG-family proteins. The putative BAG -4 (SEQ ID NO : 8 ) and 
BAG- 5 (SEQ ID NO: 10) proteins contain BAG-domains that 
share the greatest sequence similarity with the BAG-domain 
of BAG - 3 (SEQ ID NO : 6 ) . These were designated BAG -4 
(Accession number AA693697, N74588) and BAG- 5 (Accession 
number AA456862, N34101). BAG -4 has 62% identity and ~81% 
similarity to BAG - 3 , and BAG- 5 has 51% identity and ~75% 
similarity to BAG -3 . 

Additional BAG-family orthologues or homologues 
were also identified using computer-based searches and 
resulted in BAG-family homologue in the nematode C. elegans 

and the fission yeast S. pombe . The C. elegans genome 

encodes two apparent BAG-family proteins, which are most 
similar in their overall sequences to the human BAG-1 
(Afo39713, gi:2773211) (SEQ ID NO:12) and BAG- 2 (SEQ ID 
NO:14) (Afo68719, gi:3168927). The S. pombe contains two 
BAG-family proteins that share the greatest overall 
sequence similarity with human BAG-1 ( Alo23 S54 , gi/3 13 3 1 0 5 
and Alo23634, gi/3150250) . The human and C. elegans BAG-1 

proteins as well as S. pombe BAG - 1A all have ubiquitin- like 

domains near their N-termini (see Figure 10A) of unknown 
function . 
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The overall predicted amine acid sequences of the 

C. elegans BAG- 1 (SEQ ID NO: 12) and S. pombe BAG - 1A (SEQ ID 

NO:16! proteins are ~18% identical ("61% similar) and ""17% 
identical (~64% similar), respectively, to human BAG- 1 , 
5 implying origin from a common ancestral gene. The C. 

elegans BAG- 1 protein (SEQ ID NO:12), however, contains a 

5 to 7 amino acid insert in its BAG-domain as compared to 
the human, murine, and yeast BAG- 1 homologues (see Figure 
10B) , and is more similar to BAG - 2 (SEQ ID NO : 4 ) in regard 
10 to its BAG-domain. C. elegans and human BAG- 2 also may be 

derived from a common ancestor as the C- terminal 225 amino 
acid region which encompasses both the BAG domain and 
upstream region of both C. elegans and human BAG- 2 share 

~34% amino acid sequence identity and ~70% similarity. The 
15 human BAG- 2 protein (SEQ ID NO : 4 ) , however, contains a 9 
amino acid insert in its BAG-domain compared to it 
C . elegans counterpart (see Figure 10B) . Evolutionary- tree 

prediction algorithms suggest that human and C. elegans 

BAG - 2 represent a distinct branch of the BAG-family that is 
20 more evolut ionarily distant from the other BAG-family 
proteins . None of the predicted BAG- family proteins 
contain recognizable regions analogous to those found in 
other Hsc70 regulatory proteins, such as the J-domains and 
G/F-domains of DnaJ family proteins and the 
25 Tetratricopept ide Repeat (TR) domains of Hip/Hop family 
proteins . 

C. Yeast two-hvbrid assay of BAG binding to Hsc70/ATPase 

The longest of the cDNAs obtained for the BAG- 2 
and BAG- 3 proteins were expressed with N-terminal 
30 transact ivat ion (TA) domains in yeast and tested by yeast 
two-hybrid assay for interactions with fusion proteins 
consisting of Hsp70/ATPase or a variety of unrelated 
proteins (Fas, Siah, Fadd) containing N-terminal LexA DNA- 
binding domains. TA-BAG-2 and TA- BAG - 3 demonstrated 
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positive interactions with LexA-Hsc7 0 /ATPase , resulting in 
transact ivat ion of a lacZ reporter gene that was under the 

control of LexA operators (Figure 11A) . No interactions 
with LexA- Fas (cytosolic domain), LexA-Siah, LexA-Fadd, or 
LexA were detected (see Figure 11A) demonstrating that the 
BAG- 2 and BAG- 3 proteins interact specifically with 
Hsc70/ATPase . Specific two-hybrid interactions between 
Hsc70/ATPase and either BAG- 2 or BAG- 3 were also observed 
when BAG- 2 and BAG- 3 were expressed as LexA DNA-binding 
domain fusion proteins and Hsc70/ATPase was fused with a TA 
domain (see Figure 11A; right panel) . These results 
demonstrate that similarly to BAG - 1 , BAG - 2 and BAG - 3 
specifically interact with Hsc70/ATPase . 

In order to determine whether the BAG proteins 
are capable of forming heterodimers, coexpression of BAG - 2 
and BAG- 3 in the yeast two-hybrid assay was also performed. 
Coexpression of BAG- 2 and BAG- 3 failed to show interaction 
with BAG - 1 or a deletion mutant of BAG - 1 (AC) which is 
missing part of its C-terminal domain required for 
Hsp70/Hsc70 binding suggest that these proteins do not form 
heterdimers . 

D. Isolation and characterization of the complete open 
reading frame sequences of BAG- 2 and BAG- 3 

In order to deduce the complete ORFs of BAG- 2 and 
BAG - 3 , a A-phage cDNA library was screened as follows, 
using hybridization probes derived from the two-hybrid 
screening. A human jurkat T-cell X-ZapII library cDNA 
library (Stratagene) was screened by hybridization using 
3 "P-labeled purified insert DNA from the longest of the 
human BAG -2 (clone #11) and human BAG -3 (clone #2 8) cDNA 
clones. From about one million clones screened, 38 BAG - 2 
and 23 BAG- 3 clones were identified, cloned, and their cDNA 
inserts recovered as pSKII plasmids using a helper phage 
method (Stratagene) . DNA sequencing of X-phage derived 
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human BAG - 2 cDNA clones revealed an ORF encoding a 
predicted 211 amino acid protein, preceded by an in-frame 
stop Godon. The longest human BAG- 3 X-phage cDNA clone 
contains a continuous ORF of 682 amino acids followed by a 
5 stop codon, but without an identifiable start codon (see 
Figure 10A) . 



Although BAG- 1L (SEQ ID NO : 2 ) , BAG- 1 (beginning 
at residue 116 of SEQ ID NO : 2 ) , BAG - 2 (SEQ ID NO : 4 ) , and 
BAG- 3 (SEQ ID NO : 6 ) all contain a homologous BAG domain 

10 near their C-terminus, the N-terminal regions of these 
proteins are dissimilar. Using a combination of search 
tools (Prosite Search: PP search, using the Prosite pattern 
database, BCM Search Launcher, Baylor College of Medicine, 
and Blocks Search) , it was determined that the BAG - 2 N- 

15 terminal region contains potential kinase phosphorylation 
sites but otherwise shares no apparent similarity with 
other proteins or known functional domains. 



In contrast, the predicted N-terminal region 
BAG - 3 contains a WW domain as shown in Figure 10A. WW 

20 domains have been identified in a wide variety of signaling 
proteins, including a Yes kinase adaptor protein (YAP), the 
Na" -channel regulator Nedd4 , f ormin-binding proteins, 
dystrophin, and the peptidyl prolyl cis- trans - isomerase 
Pin-1. These roughly 40 amino acid domains mediate protein 

25 interactions and bind the preferred peptide ligand sequence 
xPPxY (Sudol., TIBS . 21: 161-163, 1996, which is 
incorporated herein by reference) . 
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EXAMPLE II 

In vitro Association of 
BAG proteins and Hsc7C/ATPase 

This example demonstrates that BAG- 2 ( SEQ ID 
NO:4) and BAG - 3 (SEQ ID NO : 6 ) bind Hsc70/ATPase in various 
In vitro assays. 

L Solution binding assay of BAG - 2 and BAG- 3 to 

Hsc70 /ATPase 

Association of BAG- 2 (SEQ ID NO : 4 ) and BAG- 3 (SEQ 
ID NO : 6 ) with Hsc70/ATPase was determine by an In vitro 

protein binding assay where Hsc70/ATPase or BAG- family 
proteins were expressed in bacteria as Glutathione S- 
Transf erase (GST) fusion proteins. Purified cDNA sequences 
encoding residues 5 to 211 of human BAG- 2 (clone #11) and 
the C-terminal 135 amino acids of human BAG- 3 (clone #28) 
(see Figure 10A) were subcloned into the EcoRI/Xho I sites 
of pGEX4T-l prokaryotic expression plasmid (Pharmacia; 
Piscataway, NJ) . These plasmids as well as pGEX4T- 1 -BAG- 1 , 
pGEX-4T-l-BAG-l (AC), and pGEX-4T-l-XL which have been 
described previously (Takayama et al . , supra (1997); Xie et 

al . , Biochemistry . 37:6410-6418, (1998), which are 
incorporated herein by reference), were expressed in XL-1 
blue strain E. Coli (Stratagene, Inc., La Jolla, CA) . 

Briefly, a single colony was inoculated into 1L of LB media 
containing 50 /ig/ml ampicillin and grown at 37°C overnight. 
The culture was then diluted by half with fresh 
LB/ampicillin and cooled to room temperature for 1 hr , 
before inducing with 0.4mM IPTG for 6 h at 25°C. 

Cells were recovered and incubated with 0.5 mg/ml 
lysozyme in 50 mM Tris (pH 8.0), 150 mM NaCl , 1% Tween-20, 
0.1% 2-mercapcoethanol , 5 mM EDTA, 1 mM PMSF and a mixture 
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of other protease inhibitors obtained from Boehringer 
Mannheim (1697498) at room temperature for 0.5 h, followed 
by sonication. Cellular debris were pelleted by 

centrif ugation at 27 / 500g for 10 min and the resulting 
5 supernatants were incubated with 30 ml of glut at hionine - 
Sepharose (Pharmacia) at 4°C overnight. The resin was then 
washed with 20 mM Tris (pH 8.0), 150 mM NaCl , 0.1% Tween- 
20, and 0.1% 2 - mercaptoe thanol until the OD 280nm reached 
<0.01. For removal of GST , the resin with immobilized GST- 

10 fusion protein was incubated with 10U of thrombin 
(Boehringer, Inc.) at 4°C in 20 mM Tris (pH 8.0), 150 mM 
NaCl, 0.1% Tween-20, 0.1% 2 -Mercaptoe thanol , and 2.5 mM 
CaCl2 overnight . Released proteins were then purified on 
Mono Q (HR10/10, Pharmacia) by FPLC using a linear gradient 

15 of 0 . 5M NaCl at pH 8.0 and dialyzed into chaperone assay 
buffer . 



The ability of BAG - 2 (SEQ ID NO: 4) or BAG -3 (SEQ 
ID NO: 6) to bind Hsc70/ATPase in solution was then 
examined. GST control or GST-BAG proteins were immobilized 
20 on glutathione -Sepharose and tested for binding to 35S- 
labeled in vitro translated (IVT) proteins. 
Immunoprecipitation and in vitro GST-protein binding assays 
were performed as described by Takayama et al . , supra. 
(1997) , using pCI-Neo flag or pcDNA3-HA into which human 
25 Bag-2 (clone #11) or human BAG - 3 (clone #28) had been 
subcloned for in vitro translation of 35S-L-methionine 
labeled proteins or expression in 293T cells. As shown in 
Figure 11B, 35 S -Hsc70 /ATPase bound in vitro to GST-BAG- 1 , 
GST -BAG - 2 , and GST -BAG - 3 but not to GST-BAG- 1 (AC) or 
30 several other control proteins. BAG- 1 (beginning at 
residue 116 of SEQ ID NO : 2 ) , BAG-2 (SEQ ID NO : 4 ) , and BAG -3 
(SEQ ID NO: 6) also exhibited little or no binding to 
themselves or to each other, demonstrating that these 
proteins do not strongly homo- or hetero-dimerize or 
35 oligomerize. It should be noted, however, that BAG-2 (SEQ 



BNSDOCID <WC . 0014106A"! JA> 




WO 00/14106 PCTAJS99/21053 

32 

ID NO : 4 ) displayed weak interactions with itself in binding 

assays and produced a positive result in yeast two-hybrid 

experiments, demonstrating that it can have the ability to 
self-associate. 

5 B . Binding of BAG proteins to Hsc70 in vivo 

The ability of BAG - 2 (SEQ ID NO : 4 ) and BAG - 3 (SEQ 
ID NO : 6 ) proteins to interact in cells with Hsc70 was 
tested by expressing these proteins with N-terminal Flag 
epitope tags in 293T human epithelial cells using co- 
10 immunoprecipi tat ion assays as described previously 
(Takayama et al . , supra (1997) J. cDNAs encoding the A- 

phage cloned regions of BAG- 2 and BAG- 3 were subcloned in- 
frame into pcDNA3-Flag. Anti-Flag immune complexes 
prepared from 293T cells after transfection with plasmids 

15 encoding Flag-BAG-1, Flag -BAG- 2 , or Flag - BAG- 3 were 
analyzed by SDS - PAGE/ immunoblot assay. As shown in Figure 
10C, antiserum specific to Hsc70 detected the presence of 
BAG proteins associated with Hsc70, whereas control immune- 
complexes prepared with IgGl as well as anti-Flag immune 

20 complexes prepared from cells transfected with Flag-tagged 
control proteins, Daxx and Apaf-1, did not contain Hsc70 
associated protein. These results further demonstrate that 
3AG-family proteins specifically bind to Hsc70 . 

C. BIAcore assay of BAG protein binding t_Q the ATPase 

2 5 domain of Hsc70 

BAG - 1 (beginning at residue 116 of SEQ ID NO : 2 ) 
is known to bind nightly to the ATPase domain of Hsc70 
(Stuart et al . , J . Biol , Chem. . In Press (1998)). BAG - 2 
(SEQ ID NO:4) and BAG - 3 (SEQ ID NO : 6 ) proteins were 
30 therefore, examined for their ability to bind to 
Hsc70/ATPase . The affinity and binding kinetics of BAG- 2 
(SEQ ID NO:4) and BAG- 3 (SEQ ID NO : 6 ) to Hsc70/ATPase was 
also compared to that of BAG-1 (beginning at residue 116 of 
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SEQ ID NO:2) for Ksc70 /ATPase , using a surface plasmon 
resonance technique (BIAcore) which has been described 
previously (Stuart et al . , supra, (1998) which is 

incorporated herein by reference) . 



5 BAG- family proteins were produced in bacteria and 

purified to near homogeneity as shown in Figure 12A and 
described above in Example I. The purified BAG- 1 

(beginning at residue 116 of SEQ ID NO : 2 ) , -2 (SEQ ID 
N0:4), and -3 (SEQ ID NO : 6 ) proteins were then immobilized 

10 on biosensor chips and tested for their interactions with 
Hsc70 in the soluble phase. Kinetic measurements were 
performed using a BIAcore- I I instrument with CM 5 sensor 
chip and Amine Coupling Kit (Pharmacia Biosensor AB, 
Sweden). Briefly, for immobilization of proteins, the 

15 sensor chip was equilibrated with HK buffer (10 mM Hepes 
(pH 7.4), 150 mM KCL) at B^l/min, then activated by 
injecting 17 jil of 0 . 2M N-ethyl -N ' - (3 -diethylaminopropyl ) - 
carbodiimide and 0 . 05M N-hydroxysuccinimide (NHS /EDO 
followed by 35 /il of the protein of interest, in 10 mM 

20 acetate, pK 3.5-4.5. Excess NKS-ester on the surface was 
deactivated with 17 /il 1M e thanol amine -HCL (pH8.5). After 
immobilization, 5/zl of regeneration buffer (50 mM phosphate 
(pH 6.8) and 4M GuHCl) was injected. For binding assays, 
Hsp70 (Sigma, H8778) was dissolved in HK buffer, and 

25 injected at 10 jzl/min across the prepared surface at 
various concentrations. The surface was regenerated after 
each injection with 5 jil of regeneration buffer. The rate 
constants K ass and K diSS were generated with BIAevaluat ion 
softward 3.01 (Pharmacia Biosensor AB) . Addition of Hsc70 

30 to chips containing BAG- 1 (beginning at residue 116 of SEQ 
ID N0:2), BAG - 2 (SEQ ID NO : 4 ) or BAG - 3 (SEQ ID NO : 6 ) 
resulted in concentrat ion - dependent binding, as reflected 
by an increase in the Response Units (RU) measured at the 
chip surface (shown in Figure 3B) . In contrast, Ksc70 

35 failed to display interactions in BIAcore assays with a 
variety of control proteins as well as a mutant of BAG - 1 
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lacking a C-terminal portion of the BAG domain which is 
required for Hsc70 -binding (Figure 3B) . Furthermore, 
flowing of various control proteins such as GST, BSA and 
Bcl-XL over the BAG - 1 (beginning at residue 116 of SEQ ID 
5 NO:2), BAG - 2 (SEQ ID NO : 4 ) , or BAG - 3 (SEQ ID NO : 6 ) chips 
resulted in negligible interaction. These results further 
demonstrate the specificity with which BAG- family proteins 
interact with and bind to Hsc70 . 

The rates of Hsc70 binding to BAG - 1 (beginning at 

10 residue 116 of SEQ ID NO : 2 ) , BAG - 2 (SEQ ID NO : 4 ) , and BAG - 3 
(SEQ ID NO : 6 ) proteins were similar, following pseudo 
first-order kinetics with estimated association rate 
constants (kJ of 2.1, 2.1 and 2.4 x 10 5 M" 1 sec" 1 , 
respectively. After allowing binding of Hsc70 to 

15 immobilized BAG - 1 (beginning at residue 116 of SEQ ID 
NO:2), BAG - 2 (SEQ ID NO : 4 ) , or BAG - 3 (SEQ ID NO : 6 ) to reach 
plateau levels, the chaperone was removed from the flow 
solution and the dissociation rate was monitored. BAG - 1 
(beginning at residue 116 at SEQ ID NO : 2 ) and BAG - 2 ( SEQ ID 

20 NO:4) exhibited similar dissociation rates, with relatively 
slow loss of Hsc70 from the chip surface, resulting in 
estimated dissociation rate constants (K d ) of 3.0 and 5.0 x 
10"' sec" 1 , respectively (see Figure 3B) . In contrast, Hsc70 
dissociated more rapidly from biosensor chips containing 

25 BAG - 3 (see Figure 3B) , yielding an estimated K d of 1 . 7 x 10" J 
sec" 1 . From the kinetic data, the apparent affinities (k d 
= K d /K a ) were calculated for binding of Hsc70 to BAG - 1 
(beginning at residue 116 of SEQ ID NO : 2 ) , BAG - 2 (SEQ ID 
NO:4), and BAG - 3 (SEQ ID NO : 6 ) and were estimated to equal 

30 about K D = 1.4nM, K D =2.4nM, and K D =7.4nM, respectively. These 
results demonstrate that the interactions of BAG-family 
proteins with Hsc70 occur with apparent affinities 
sufficient for physiological relevance. 
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EXAMPLE III 



BAG-family proteins inhibit 
Hsp70/Hsc70-dependent protein folding 



This example demonstrates that BAG-2 (SEQ ID 
5 NO:4) and BAG- 3 (SEQ ID NO : G ) proteins inhibit Hsp70/Hsc70- 
dependent refolding of denatured proteins similarly to a 
BAG - 1 (beginning at residue 116 of SEQ ID NO : 2 ) protein. 



ID NO: 6) protein on Hsp70 /Hsc70 - dependent protein refolding 
10 was determined using in vitro protein refolding assays 
similar to those described previously by Takayama et al . , 
supra, 1998 ; Terada et al . , J Cell Biol , . 139:108 9-1095, 
1997, which are incorporated herein by reference. Briefly, 
lucif erase (20/xM) was denatured in 25 mM Hepes-KOH, pH 7.2, 
15 50 mM potassium acetate, 5 mM DTT, 6M guanidine 
hydrochloride at ~25°C for 1 h. Denatured lucif erase was 
diluted 1:40 into 25 mM Hepes-KOH, pH 7.2, 50 mM potassium 
acetate, 5 mM DTT. Hsc70 (1.8 fiM) , DnaJ (StressGen, Inc.) 
(0.9/iM), and various purified recombinant proteins as 
20 indicated were added to refolding buffer (30 mM Hepes-KOH, 
pH 7.6, 120 mM potassium acetate, 3mM magnesium acetate, 2 
mM DTT, 2.5 mM ATP) with 0.2 volume of diluted denatured 
lucif erase to a final concentration of 0.1 /iM . Lucif erase 
activity was measured after 1.5 hr incubation at 35°C. 

25 The combination of Hsc70 and DnaJ resulted in 

ATP-dependent refolding of chemically denatured firefly 
lucif erase, with function of over half the denatured enzyme 
restored in a 90 minute reaction, as monitored by a 
chemiluminescence assay. In contrast, neither Hsc70 nor 

30 DnaJ alone were able to induce substantial refolding of 
denatured lucif erase. Furthermore, little spontaneous 



The effects of BAG-2 (SEQ ID NO : 4 ) and BAG - 3 (SEQ 
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restoration of luciferase activity was observed with 
control proteins, BSA , GST or Bcl-XL (see Figure 4A) . 

Addition of recombinant purified BAG- 1 (beginning 
5 at residue 116 of SEQ ID NO : 2 ) , BAG - 2 (SEQ ID NO : 4 ) , or 
BAG - 3 (SEQ ID NO : 6 ) to the above assays in amounts 
equimolar to Hsc70 (1.8 fxM) resulted in striking inhibition 
of luciferase refolding. BAG - 2 (SEQ ID NO : 4 ) and BAG - 3 
(SEQ ID NO : 6 ) displayed somewhat greater inhibitory 
10 activity than BAG - 1 (beginning at residue 116 of SEQ ID 
NO:2) as shown in Figure 4A. In contrast, the BAG- 1 (AC) 
protein, which fails to bind Hsc70 as well as several other 
control proteins, had no effect on luciferase refolding. 

In an additional refolding assay, described 

15 previously by Minami et al . , J Biol . Chem . 271:19617-24, 

1996), purified Hsc70 and human DnaJ homolog Hd j - 1 (Hsp 40) 
were used with additional cofactors provided in 
reticulocyte lysates (5% v:v) to produce a system capable 
of refolding denatured luciferase. Briefly, additional 

20 cofactors included, recombinant Luciferase (Promega: 
QuantiLum TM) , that had been heat denatured at 42°C for 10 
min, 1.8 jj.M Hsc70 (Sigma; purified from bovine brain), 0.9 
fiM Hsp4 0, and various recombinant purified proteins. 
Luciferase activity was measured (Promega luciferase assay 

25 kit) using a luminometer (EG&G Berthold, MicroLumat 
luminometer, Model #LB96P) . All results were normalized 
relative to non-denatured luciferase that had been 
subjected to the same conditions. Control reactions 
lacking ATP, Hsc70, or Hsp40 resulted in negligible 

30 luciferase refolding. 

Various amounts of purified BAG- 1 (beginning at 
residue 116 of SEQ ID NO : 2 ) , BAG - 2 (SEQ ID NO : 4 ) , or BAG - 3 
(SEQ ID NO : 6 ) , relative to amounts of Hsc70 were used in 
the above-described protein refolding assay. Addition of 
35 BAG-family proteins resulted in a concentration-dependent 
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inhibition of Hsc70 chaperone activity. Furthermore, the 
BAG- 2 (SEQ ID NO : 4 ) and BAG- 3 (SEQ ID NO : 6 ) inhibition of 
Hsc70 chaperone activity was demonstrated to be as potent 
as that observed for BAG - 1 (beginning at residue 116 of SEQ 
5 ID NO:2). In contrast, the BAG- 1 (AG) mutant as well as 
other control proteins did not suppress Hsc70 -mediated 
refolding of denatured lucif erase. These results indicate 
that BAG- 2 (SEQ ID NO : 4 ) and BAG- 3 (SEQ ID NO : 6 ) can 
inhibit Ksc70/Hsp70 dependent protein refolding activity to 
10 the same extent as BAG - 1 (beginning at residue 116 of SEQ 
ID NO: 2) . 

B. BAG competes with Hip for binding to Hsc70 . 

It is known that BAG - 1 competes with Hip for 
binding to Hsc70, with these proteins exerting opposite 
15 effects on Hsc70-mediated protein refolding (Hohfeld, J., 
and Jentsch, S., Embo J., 16:6209-6216, 1997, which is 

incorporated herein by reference) . In order to determine 
whether BAG - 2 (SEQ ID NO : 4 ) and BAG - 3 (SEQ ID NO : 6 ) also 
compete with Hip for binding to Hsc70, refolding assays 
20 were performed as described above in the presence of Hip 
protein . 

Hip was purified as His 6 -protein . The fusion 
protein was induced from pET28-Hip (V. Prapapanich et al . , 
Mol Cell Biol., 18:944-952, 1998, which is incorporated 

25 herein by reference) with 0.1 mM IPTG at 25°G for 6h in BL21 
cells. Cells from 1L of culture were resuspended into 50 
ml of 50 mM Phosphate buffer (pH 6.8), 150 mM NaCl , and 1% 
(v/v) Tween-20 and then incubated with 0.5 mg/ml lysozyme 
at 25°C for 0.5h, followed by sonication. After 

30 centrif ugation at 27,500g for 10 min, the resulting 
supernatant was mixed with 15 ml nickel resin (Qiagen, 
Inc.) at 4°C for 3 h with 25 mM imidazol . The resin was 
then washed with 50 mM phosphate buffer (pH 6.8), 25 mM 
imidazol, 150 mM NaCl and 0.1% Tween-20 until the OD280nm 
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reached a value of <0.01. His 6 -Hip protein was eluted with 
250 mM imidazol in washing buffer (Qiagene, Inc.) and 
purified on Mono Q (HR10/10 Pharmacia) by FPLC using a 
linear gradient of 0 . 5M NaCl at pH 8.0, followed by 
5 dialysis in chaperone assay buffer. 

In the refolding assay reactions, addition of 
purified Hip at equimolar concentrations relative to BAG- 1 
(beginning at residue 116 of SEQ ID NO : 2 ) , BAG - 2 (SEQ ID 
NO:4), or BAG - 3 (SEQ ID NO : 6 ) (1.8 /xM) completely negated 

10 the inhibitory effects of the BAG-family proteins on 
refolding of denatured luciferase (see Figure 4C) . These 
results demonstrate that the suppression of Hsc70 chaperone 
activity by BAG-family proteins is reversible, and that Hip 
antagonizes the effects of not only BAG- 1 (beginning at 

15 residue 116 of SEQ ID NO : 2 ) , but also of BAG - 2 (SEQ ID 
NO:4) and BAG- 3 (SEQ ID NO : 6 ) . 

In summary, these results demonstrate that BAG- 
family proteins all contain a conserved BAG domain near 
their C-terminus that binds Hsc70/Hsp70 ; and that human 
20 BAG-family proteins can bind with high affinity to the 
ATPase domain of Hsc70 and inhibit its chaperone activity 
through a Hip-repressable mechanism. 

EXAMPLE IV 

EXPANDED NUCLEIC ACID AND AMINO ACID SEQUENCES 
2 5 FOR HUMAN BAG - 3 . BAG - 4 AND BAG - 5 

Following the procedures disclosed herein, the 
nucleic acid and amino acids sequences to human BAG - 3 , 
BAG- 4 and BAG- 5 were further expanded. The expanded 
sequences for BAG - 3 , BAG - 4 and BAG - 5 are shown in 
30 Figures 15, 16 and 17, respectively, with their respective 
sequence identification numbers, "SEQ ID NO" s. 
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We claim: 

1. A compound of the formula, 
R N -R 1 X 1 R 2 X 2 R 3 X 3 R 4 X 4 R 5 X 5 R 6 xVx 7 X 8 R 9 X 9 R 10 X 10 R 11 X il -R c 

wherein, 

5 R N is a group of about 1 to 552 independently 

selected amino acids; 
R 1 is a group of 3 independently selected amino 
acids ; 

X 1 is an amino acid with a charged or uncharged 
10 R group; 

R 2 is a group of 7 independently selected amino 
acids ; 

X 2 is an amino acid with a charged R group; 
R 3 is a group of 5 independently selected amino 
15 acids; 

X 3 is an amino acid with an apolar R group; 
R 4 is a group of 3 independently selected amino 
acids ; 

X 4 is an amino acid with charged R group; 
20 R 5 is a single independently selected amino acid; 

X 5 is an amino acid with apolar or uncharged R 
group ; 

R 6 is a group of 15 independently selected amino 
acids ; 

25 X 6 is an amino acid with a charged or uncharged 

R group ; 

R 7 is a group of 2 independently selected amino 
acids ; 

X 1 is an amino acid with a charged R group; 
30 X 8 is an amino acid with a charged R group; 

R 9 is a group of 2 independently selected amino 
acids ; 

X 9 is an amino acid with an apolar R group; 
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R 10 is a group of 3 independently selected amino 
acids ; 

X 10 is an amino acid with an uncharged R group ; 
R 11 is a group of 2 independently selected amino 
acids ; 

X 11 is an amino acid with an apolar R group; and 
R c is a group of about 1 to 100 independently 
selected amino acids . 



2. A substantially purified nucleic acid 
10 molecule having a nucleotide sequence corresponding to or 
complementary to at least 20 nucleotides from a nucleotide 
sequence selected from the group consisting of (SEQ ID 
NO:l) , (SEQ ID NO: 3) , (SEQ ID NO : 5 ) , (SEQ ID NO : 7 ) , (SEQ ID 
NO: 9); (SEQ ID NO: 19), (SEQ ID NO: 21) and (SEQ ID NO: 23). 

15 3 . The nucleic acid of claim 2 having a 

nucleotide sequence corresponding to or complementary to a 
nucleotide sequence that encodes a functionally active BAG 
family protein selected from the group consisting of (SEQ 
ID NO:2), (SEQ ID NO : 4 ) , (SEQ ID NO : 6 ) , (SEQ ID NO : 8 ) , (SEQ 

20 ID NO:10), (SEQ ID NO:20), (SEQ ID NO:22) and (SEQ ID 
NO: 24) . 

4 . The nucleic acid of claim 3 selected from 
the group consisting of (SEQ ID NO : 1 ) , (SEQ ID NO : 3 ) , (SEQ 
ID NO:5), (SEQ ID NO : 7 ) , (SEQ ID NO : 9 ) , (SEQ ID NO:19), 
2 5 (SEQ ID NO: 21) and (SEQ ID NO: 23) . 

5. The nucleic acid of claim 3 complementary to 
a nucleotide sequence that encodes a functionally active 
BAG protein selected from the group consisting of (SEQ ID 
NO: 2) , (SEQ ID NO : 4 ) , (SEQ ID NO: 6) , (SEQ ID NO : 8 ) , (SEQ ID 

30 NO:10), (SEQ ID NO : 2 0 ) , (SEQ ID NO:22) and (SEQ ID NO:24). 

6. A substantially purified nucleic acid 
molecule having the nucleotide sequence of (SEQ ID NO : 3 ) . 
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7. A substantially purified nucleic acid 

molecule having the nucleotide sequence of (SEQ ID NO : 5 ) . 



8 . A substantially purified nucleic acid 

molecule having the nucleotide sequence of (SEQ ID NO : 7 ) . 

9. A substantially purified nucleic acid 

molecule having the nucleotide sequence of (SEQ ID NO : 9 ) . 



10. A substantially purified nucleic acid 
molecule having the nucleotide sequence of (SEQ ID NO: 19) . 

10 11. A substantially purified nucleic acid 

molecule having the nucleotide sequence of (SEQ ID NO: 21) . 

12 . A substantially purified nucleic acid 
molecule having the nucleotide sequence of (SEQ ID NO: 23) . 

13 . A substantially purified BAG family protein 
15 encoded by the nucleic acid molecule of claim 1. 



14 . A substantially purified BAG family protein 
comprising of the amino acid sequence selected from the 
group consisting of (SEQ ID NO : 2 ) , (SEQ ID NO : 4 ) , (SEQ ID 
NO:6) , (SEQ ID NO:8) , (SEQ ID NO:10) , (SEQ ID NO : 2 0 ) , (SEQ 
20 ID NO: 22) and (SEQ ID NO: 24) or a fragment, a derivative or 
a mimetic thereof. 



15 . A substantially purified protein 

corresponding to the amino acid sequence of 157 to 204 of 
( SEQ ID NO: 2) . 



25 16. A substantially purified protein 

corresponding to the amino acid sequence of 272 to 319 of 
(SEQ ID NO : 2) . 
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17. A substantially purified protein 

corresponding to the amino acid sequence of 164 to 211 of 
(SEQ ID NO : 4 ) . 



IS . A substantially purified protein 

5 corresponding to the amino acid sequence of 418 to 510 of 
(SEQ ID NO : 20) . 



19. A substantially purified protein 

corresponding to the amino acid sequence of 378 to 457 of 
(SEQ ID NO : 22) . 



10 20. A substantially purified protein 

corresponding to the amino acid sequence of 6 to 97 of (SEQ 
ID NO : 24 ) . 



21. A substantially purified protein 

corresponding to the amino acid sequence of 180 to 257 of 
15 (SEQ ID NO: 24) . 



22. A substantially purified protein 

corresponding to the amino acid sequence of 272 to 349 of 
(SEQ ID NO : 24 ) . 



22 . A substantially purified protein 

20 corresponding to the amino acid sequence of 362 to 444 of 
(SEQ ID NO : 24 ) . 



24. A pharmaceutical composition comprising a 
nucleic acid molecule of claim 1 useful for modulating 
tumor cell proliferation, cell migration and metastasis, 
25 and steroid hormone receptor function. 



25. A method of modulating tumor cell 
proliferation, cell migration and metastasis, and steroid 
hormone receptor function by administering a nuclei 2 acid 
molecule of claim 1 . 
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26. A pharmaceutical composition comprising a 
substantially purified BAG family protein comprising of the 
amino acid sequence selected from the group consisting of 
(SEC' If' NO: 2! , ( SEQ ID NO : 4 ) , ( SEQ 10 NO : 6 ) , ( SEQ ID NO : 8 } , 
( SEC 1 ID NO:i0), (SEQ ID NO : 2 0 ) , (SEQ ID NO : 2 2 ) and (SEQ ID 
NO:24), or a fragment, a derivative or a mimetic thereof, 
useful for modulating tumor cell proliferation, cell 
migration and metastasis, and steroid hormone receptor 
f unct ion . 



10 27. A method of modulating tumor cell 

proliferation by administering a pharmaceutical composition 
of claim 26 . 

28 . A method of modulating cell migration and 
metastasis by administering a pharmaceutical composition of 
15 claim 26 . 

29. A method of modulating steroid hormone 
receptor function by administering a pharmaceutical 

composition of claim 26 . 

30. A substantially purified antibody that 
20 specifically binds to a BAG family protein of claim 14. 

31. The antibody of claim 30, wherein said 
antibody is a monoclonal antibody. 
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32. A method for detecting the presence of a BAG 
family protein in a sample, comprising the steps of: 

a. obtaining the sample; 

b. adding to said an antibody of claim 11 
5 under suitable conditions for the 

binding of said antibody with the BAG 
family protein ; and 

c. detecting said bound BAG family 
protein . 

10 33. A method for detecting the presence of a 

first nucleic acid molecule that encodes a BAG family 
protein in a sample, comprising the steps of: 

a . obtaining the sample ; 

b. adding to said sample a second nucleic 
15 acid molecule capable of hybridizing 

with said first nucleic acid molecule 
under suitable conditions for the 
binding of said second nucleic acid 
molecule with said first nucleic acid 
20 molecule; and 

c. detecting said hybridized first and 
second nucleic acid molecules. 

34 . A method of determining the risk of 
metastatic spread of cancer or prognosis of cancer patients 
25 by determining the level of expression of a BAG* family 
protein . 
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SEQUENCE LISTING 

<110> Reed, John C. 

Takayama, Shinichi 
The Burnham Institute 

<120> Novel BAG Proteins and Nucleic Acid Molecules Encoding 
Them 

<130> FP-LJ 3646 

<14 0> 
< 1 4 1 > 

<150> 09/150,489 

<151> 1998-09-09 

<160> 24 

<170> Patentln Ver. 2.0 

<210> 1 

<211:- 1291 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (46) . . (1080) 

<400> 1 

acgccgcgct cagcttccat cgctgggcgg tcaacaagtg cgggc ctg get cag cgc 57 

Leu Ala Gin Arg 
1 

ggg ggg gcg egg aga ccg cga ggc gac egg gag egg ctg ggt tec egg 105 
Gly Gly Ala Arg Arg Pro Arg Gly Asp Arg Glu Arg Leu Gly Ser Arg 
5 10 15 20 

ctg cgc gec ctt egg cca ggc egg gag ccg cgc cag teg gag ccc ccg 153 
Leu Arg Ala Leu Arg Pro Gly Arg Glu Pro Arg Gin Ser Glu Pro Pro 

25 30 35 

gec cag cgt ggt ccg cct ccc tct egg cgt cca cct gee egg agt act 201 
Ala Gin Arg Gly Pro Pro Pro Ser Arg Arg Pro Pro Ala Arg Ser Thr 
40 45 50 

gee age ggg cat gac cga ccc ace agg ggc gec gee gec ggc get cgc 249 
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Ala Se: Gly His Asp Arg Pro Thr Arg Gly Ala Ala Ala Gly Ala Arg 
5 5 60 65 

agg ccg egg atg aag aag aaa acc egg cgc cgc teg acc egg age gag 297 

Arg Pro Arg Met Lys Lys Lys Thr Arg Arg Arg Ser Thr Arg Ser Glu 
70 75 80 

gag ttg acc egg age gag gag ttg acc ctg agt gag gaa gcg acc tgg 345 

Glu Leu Thr Arg Ser Glu Glu Leu Thr Leu Ser Glu Glu Ala Thr Trp 
65 90 95 100 

agt gaa gag gcg acc cag agt gag gag gcg acc cag ggc gaa gag atg 393 

Ser Glu Glu Ala Thr Gin Ser Glu Glu Ala Thr Gin Gly Glu Glu Met 
105 110 115 

aat egg age cag gag gtg acc egg gac gag gag teg acc egg age gag 441 

Asn Arg Ser Gin Glu Val Thr Arg Asp Glu Glu Ser Thr Arg Ser Glu 
120 125 130 

gag gtg acc agg gag gaa atg gcg gca get ggg etc acc gtg act gtc 489 

Glu Val Thr Arg Glu Glu Met Ala Ala Ala Gly Leu Thr Val Thr Val 
135 140 145 

acc cac age aat gag aag cac gac ctt cat gtt acc tec cag cag ggc 537 

Thr His Ser Asn Glu Lys His Asp Leu His Val Thr Ser Gin Gin Gly 
150 155 160 

age agt gaa cca gtt gtc caa gac ctg gee cag gtt gtt gaa gag gtc 585 

Ser Ser Glu Pro Val Val Gin Asp Leu Ala Gin Val Val Glu Glu Val 

165 170 175 180 

ata ggg gtt cca cag tct ttt cag aaa etc ata ttt aag gga aaa tct 633 

He Gly Val Pro Gin Ser Phe Gin Lys Leu He Phe Lys Gly Lys Ser 
185 190 195 

ctg aag gaa atg gaa aca ccg ttg tea gca ctt gga ata caa gat ggt 681 

Leu Lys Glu Met Glu Thr Pro Leu Ser Ala Leu Gly He Gin Asp Gly 
200 205 210 

tgc egg gtc atg tta att ggg aaa aag aac agt cca cag gaa gag gtt 729 

Cys Arg Val Met Leu He Gly Lys Lys Asn Ser Pro Gin Glu Glu Val 
215 220 225 

gaa eta aag aag ttg aaa cat ttg gag aag tct gtg gag aag ata get 777 

Glu Leu Lys Lys Leu Lys His Leu Glu Lys Ser Val Glu Lys lie Ala 
230 235 240 

gac cac ctq gaa gag ttg aat aaa gag ctt act gga ate cag cag ggt 825 
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Asp Gin Leu Glu Glu Leu Asn Lys Glu Leu Thr Gly He Gin Gin Gly 

ttt ctg ccc aag gat ttg caa get gaa get etc tgc aaa ctt gat agg 373 
Phe Leu Pro Lys Asp Leu Gin Ala Glu Ala Leu Cys Lys Leu Asp Arg 
265 270 275 

aga gta aaa gec aca ata gag cag ttt atg aag ate ttg gag gag att 921 
A^g Val Lys Ala Thr He Glu Gin Phe Met Lys He Leu Glu Glu He 
280 285 290 

gac aea etg ate etg eea gaa aat tte aaa gae agt aga ttg aaa agg 969 
Asp Thr Leu lie Leu Pro Glu Asn Phe Lys Asp Ser Arg Leu Lys Arg 
295 300 305 

aaa gge ttg gta aaa aag gtt eag gea tte eta gee gag tgt gae aea 1017 
Lys Gly Leu Val Lys Lys Val Gin Ala Phe Leu Ala Glu Cys Asp Thr 
310 315 320 

gtg gag eag aac ate tgo eag gag aet gag egg ctg eag tot aca aae 1065 

Val Glu Gin Asn He Cys Gin Glu Thr Glu Arg Leu Gin Ser Thr Asn 

325 

ttt gec etg gee gag tgaggtgtag cagaaaaagg ctgtgetgec ctgaagaatg 1120 
Phe Ala Leu Ala Glu 
345 

gegeeaeeag otctgecgte tetggategg aatttacctg atttettcag ggetgetggg 1180 
ggeaactgge eatttgecaa ttttectact eteaeactgg tteteaatga aaaatagtgt 1240 
ctttgtgatt tgagtaaagc tcctattctg tttttcacaa aaaaaaaaaa a 1291 



<210> 2 
<211> 345 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Leu Ala Gin Arg Gly Gly Ala Arg Arg Pro Arg Gly Asp Arg Glu Arg 
15 10 15 



Leu Gly Ser Arg Leu Arg Ala Leu 
20 



Arg Pro Gly Arg Glu Pro Arg Gin 
25 30 



Ser Glu Pro Pro Ala Gin Arg Gly Pro Pro Pro Ser Arg Arg Pro Pro 

35 40 
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Ala Arg Ser Thr Ala Ser Gly His Asp Arg Pro Thr Arg Gly Ala Ala 

50 55 60 

Ala Gly Ala Arg Arg Pro Arg Met Lys Lys Lys Thr Arg Arg Arg Ser 
65 70 75 80 

Thr Arg Ser Glu Glu Leu Thr Arg Ser Glu Glu Leu Thr Leu Ser Glu 
85 90 95 

Glu Ala Thr Trp Ser Glu Glu Ala Thr Gin Ser Glu Glu Ala Thr Gin 
100 105 110 

Gly Glu Glu Met Asn Arg Ser Gin Glu Val Thr Arg Asp Glu 31u Ser 
US 120 125 

Thr Arg Ser Glu Glu Val Thr Arg Glu Glu Met Ala Ala Ala Gly Leu 
130 135 140 

Thr Val Thr Val Thr His Ser Asn Glu Lys His Asp Leu His Val Thr 
145 150 155 160 

Ser Gin Gin Gly Ser Ser Glu Pro Val Val Gin Asp Leu Ala Gin Val 
165 170 175 

Val Glu Glu Val lie Gly Val Pro Gin Ser Phe Gin Lys Leu He Phe 
180 185 190 

Lys Gly Lys Ser Leu Lys Glu Met Glu Thr Pro Leu Ser Ala Leu Gly 
195 200 205 

He Gin Asp Gly Cys Arg Val Met Leu He Gly Lys Lys Asn Ser Pro 
210 215 220 

Gin Glu Glu Val Glu Leu Lys Lys Leu Lys His Leu Glu Lys Ser Val 
225 230 235 240 

Glu Lys He Ala Asp Gin Leu Glu Glu Leu Asn Lys Glu Leu Thr Gly 
245 250 255 

He Gin Gin Gly Phe Leu Pro Lys Asp Leu Gin Ala Glu Ala Leu Cys 
260 265 270 

Lys Leu Asp Arg Arg Val Lys Ala Thr He Glu Gin Phe Met Lys He 
275 280 285 

Leu Glu Glu He Asp Thr Leu He Leu Pro Glu Asn Phe Lys Asp Ser 
290 295 300 
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Arg Leu Lys Arg Lys Gly Leu Val Lys Lys Va 1 Gin Ala Phe Leu Ala 

305 310 315 320 

Glu Cys Asp Thr Val Glu Gin Asn He Cys Gin Glu Thr Glu Arg Leu 
325 330 335 

Gin Ser Thr Asn Phe Ala Leu Ala Glu 
340 345 



<210> 3 

<211> 1179 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (160) . . (792) 



<400> 3 

gcagccgcgg tgtcgcgaag tcctcccggg ttgcccccgc ggcgtcagag ggagggcggg 60 

cgccgcgttg gtgacggcga ccctgcagcc caaggagcgc tccactcgct gccgccggag 120 

ggccggtgac ctcttggcta ccccgcgtcg gaggcttag atg get cag gcg aag 174 

Met Ala Gin Ala Lys 
1 5 



ate aac get aaa gec aac gag ggg cgc ttc tgc cge tec tec tec atg 222 
He Asn Ala Lys Ala Asn Glu Gly Arg Phe Cys Arg Ser Ser Ser Met 
1° 15 20 



get gac cgc tec age cgc ctg ctg gag age ctg gac cag ctg gag etc 270 
Ala Asp Arg Ser Ser Arg Leu Leu Glu Ser Leu Asp Gin Leu Glu Leu 
25 30 35 



agg gtt gaa get ttg aga gaa gca gca act get gtt gag caa gag aaa 318 
Arg Val Glu Ala Leu Arg Glu Ala Ala Thr Ala Val Glu Gin Glu Lys 
40 45 50 



gaa ate ctt ctg gaa atg ate cac agt ate caa aat age cag gac atg 366 
Glu He Leu Leu Glu Met He His Ser He Gin Asn Ser Gin Asp Met 
55 60 65 



agg cag ate agt gac gga gaa aga gaa gaa tta aat ctg act gca aac 414 
Arg Gin He Ser Asp Gly Glu Arg Glu Glu Leu Asn Leu Thr Ala Asn 
70 75 80 85 
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cgt ttg atg gga aga act etc acc gtt gaa gtg tea gta gaa aca att 462 

Arg Leu Met Giy Arg Thr Leu Thr Val Giu Val Ser Val Giu Thr lie 

90 95 100 

aga aac ccc cag cag caa gaa tec eta aag cat gcc aca agg att att 510 

Arg Asn Pro Gin Gin Gin Giu Ser Leu Lys His Ala Thr Arg lie lie 

105 110 115 

gat gag gtg gtc aat aag ttt ctg gat gat ttg gga aat gcc aag agt 553 

Asp Giu Val Val Asn Lys Phe Leu Asp Asp Leu Giy Asn Ala Lys Ser 

120 125 130 

cat tta atg teg etc tac agt gca tgt tea tct gag gtg cca cat ggg 606 

His Leu Met Ser Leu Tyr Ser Ala Cys Ser Ser Giu Val Pro His Giy 

135 140 145 

cca gtt gat cag aag ttt caa tec ata gta att ggc tgt get ett gaa 654 

Pro Val Asp Gin Lys Phe Gin Ser lie Val lie Giy Cys Ala Leu Giu 

150 155 160 165 

gat cag aag aaa att aag aga aga tta gag act ctg ett aga aat att 702 

Asp Gin Lys Lys lie Lys Arg Arg Leu Giu Thr Leu Leu Arg Asn lie 

170 175 180 

gaa aac tct gac aag gcc ate aag eta tta gag cat tct aaa gga get 750 

Giu Asn Ser Asp Lys Ala lie Lys Leu Leu Giu His Ser Lys Giy Aia 

185 190 195 



ggt tec aaa act ctg caa caa aat get gaa age aga ttc aat 792 

Giy Ser Lys Thr Leu Gin Gin Asn Ala Giu Ser Arg Phe Asn 
200 205 210 

tagtcttcaa aectaagagc atttacacaa tacacaaggt gtaaaaatga taaaatacta 852 

ttttaattga taactagttc tttgttaggt ataaccactt agttgacact gatagttgtt 912 

tcagatgagg aaaatattcc atcaagtatc ttcagttttg tgaataacaa aactagcaat 972 

attttaatta tctatctaga gattttttag attgaattct tgtcttgtac taggatctag 1032 

catatttcac tattctgtgg atgaatacat agtttgtggg gaaaacaaac gttcagctag 1092 

gggcaaaaag catgactget ttttcctgtc tggcatggaa tcacgcagtc accttgggca 1152 

tttagtttac tagaaattct ttactgg 1179 



6 
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<210^ 4 

<211> 211 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Met Ala Gin Ala Lys He Asn Ala Lys Ala Asn Glu Gly Arg Phe Cys 
15 10 15 

Arg Ser Ser Ser Met Ala Asp Arg Ser Ser Arg Leu Leu Glu Ser Leu 

20 25 30 

Asp Gin Leu Glu Leu Arg Val Glu Ala Leu Arg Glu Ala Ala Thr Ala 
35 40 45 

Val Glu Gin Glu Lys Glu He Leu Leu Glu Met He His Ser He Gin 

50 55 60 

Asn Ser Gin Asp Met Arg Gin He Ser Asp Gly Glu Arg Glu Glu Leu 
65 70 75 80 

Asn Leu Thr Ala Asn Arg Leu Met Gly Arg Thr Leu Thr Val Glu Val 
85 90 95 

Ser Val Glu Thr He Arg Asn Pro Gin Gin Gin Glu Ser Leu Lys His 
100 105 110 

Ala Thr Arg He He Asp Glu Val Val Asn Lys Phe Leu Asp Asp Leu 
115 120 125 

Gly Asn Ala Lys Ser His Leu Met Ser Leu Tyr Ser Ala Cys Ser Ser 
130 135 140 

Glu Val Pro His Gly Pro Val Asp Gin Lys Phe Gin Ser He Val He 
145 150 155 160 

Gly Cys Ala Leu Glu Asp Gin Lys Lys He Lys Arg Arg Leu Glu Thr 
165 170 175 

Leu Leu Arg Asn lie Glu Asn Ser Asp Lys Ala He Lys Leu Leu Glu 
180 185 190 

His Ser Lys Gly Ala Gly Ser Lys Thr Leu Gin Gin Asn Ala Glu Ser 
195 200 205 

Arg Phe Asn 
210 
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< 2 1 0 > 


c 


<211> 


2526 


<212, 


DNA 


'11 ~2 
V _ 1 J . ' 


H OITiO 


<22C> 




<221> 


CDS 


<222> 


(1) . 


<400-> 


r. 



gcg gag etc cgc ate caa ccc egg gee gcg gee aac ttc tct gga ctg 
Ala Glu Leu Arg He Gin Pro Arg Ala Ala Ala Asn Phe Ser Gly Leu 
1 5 10 15 



48 



gac cag aag ttt eta gee ggc cag ttg eta cct ccc ttt ate tec tec 
Asp Gin Lys Phe Leu Ala Gly Gin Leu Leu Pro Pro Phe He Ser Ser 
20 25 30 



96 



ttc ccc tct ggc age gag gag get att tec aga cac ttc cac ccc tct 
Phe Pro Ser Gly Ser Glu Glu Ala lie Ser Arg His Phe His Pro Ser 
35 40 45 



144 



ctg gee acg tea ccc ccg cct tta att cat aaa ggt gee egg cgc egg 
Leu Ala Thr Ser Pro Pro Pro Leu lie His Lys Gly Ala Arg Arg Arg 
50 55 60 



192 



ctt ccc gga cac gtc ggc ggc gga gag ggg ccc acg gcg gcg gee egg 
Leu Pro Gly His Val Gly Gly Gly Glu Gly Pro Thr Ala Ala Ala Arg 
65 70 75 80 



240 



cca gag act egg cgc ccg gag cca gcg ccc cgc acc cgc gec cca gcg 
Pro Glu Thr Arg Arg Pro Glu Pro Ala Pro Arg Thr Arg Ala Pro Ala 
85 90 95 



288 



ggc aga ccc caa ccc age atg age gee gee acc cac teg ccc atg atg 
Gly Arg Pro Gin Pro Ser Met Ser Ala Ala Thr His Ser Pro Met Met 
100 105 HO 



336 



cag gtg gcg tec ggc aac ggt gac cgc gac cct ttg ccc ccc gga tgg 
Gin Val Ala Ser Gly Asn Gly Asp Arg Asp Pro Leu Pro Pro Gly Trp 
115 120 125 



384 



gag ate aag ate gac ccg cag acc ggc tgg ccc ttc ttc gtg gac cac 
Glu lie Lys He Asp Pro Gin Thr Gly Trp Pro Phe Phe Val Asp His 
130 135 140 



432 



aac age cgc acc act acg tgg aac gac ccg cgc gtg ccc tct gag ggc 



480 
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Asn Ser Arg Thr Thr Thr Trp Asn Asp Pro Arg Val Pro Ser Glu Gly 

145 150 155 160 

ccc aag gag act cca tec tct gec aat ggc cct tec egg gag age tct 52S 

Pro Lys Glu Thr- Pro Ser Ser Ala Asn Gly Pro Ser Arg Glu Gly Ser 

165 170 175 

agg ctg ccg cct get agg gaa ggc cac cct gtg tac ccc cag etc cga 576 

Arg Leu Pro Pro Ala Arg Glu Gly His Pro Val Tyr Pro Gin Leu Arg 

180 185 190 

cca ggc tac att ccc att cct gtg etc cat gaa ggc get gag aac egg 624 

Pro Gly Tyr lie Pro lie Pro Val Leu His Glu Gly Ala Glu Asn Arq 

195 200 205 

cag gtg cac cct ttc cat gtc tat ccc cag cct ggg atg cag cga ttc 672 

Gin Val His Pro Phe His Val Tyr Pro Gin Pro Gly Met Gin Arg Phe 

210 215 220 

cga act gag gcg gca gca gcg get cct cag agg tec cag tea cct ctg 720 

Arg Thr Glu Ala Ala Ala Ala Ala Pro Gin Arg Ser Gin Ser Pro Leu 

225 230 235 240 

egg ggc atg cca gaa acc act cag cca gat aaa cag tgt gga cag gtg 768 

Arg Gly Met Pro Glu Thr Thr Gin Pro Asp Lys Gin Cys Gly Gin Val 

245 250 255 

gca gcg gcg gcg gca gee cag ccc cca gec tec cac gga cct gag egg 816 

Ala Ala Ala Ala Ala Ala Gin Pro Pro Ala Ser His Gly Pro Glu Arg 

260 265 270 

tec cag tct cca get gec tct gac tgc tea tec tea tec tec teg gee 864 

Ser Gin Ser Pro Ala Ala Ser Asp Cys Ser Ser Ser Ser Ser Ser Ala 

275 280 285 

age ctg cct tec tec ggc agg age age ctg ggc agt cac cag etc ccg 912 

Ser Leu Pro Ser Ser Gly Arg Ser Ser Leu Gly Ser His Gin Leu Pro 

290 295 300 

egg ggg tac ate tec att ccg gtg ata cac gag cag aac gtt acc egg 960 

Arg Gly Tyr lie Ser He Pro Val He His Glu Gin Asn Val Thr Arg 

305 310 315 320 

cca gca gee cag ccc tec ttc cac aaa gec cag aag acg cac tac cca 1008 

Pro Ala Ala Gin Pro Ser Phe His Lys Ala Gin Lys Thr His Tyr Pro 

325 330 335 



gcg cag agg ggt gag tac cag acc cac cag cct gtg tac cac aag ate 



1056 
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Aia Gin Arg Gly Glu Tyr Gin Thr His Gin Pro Val Tyr His Lys lie 
34C 345 350 



cag ggg gat gac tgg gag ccc egg ccc ctg egg gcg gca tec ccg ttc 
Gin Gly Asp Asp Trp Glu Pro Arg Pro Leu Arg Ala Aia Ser Pro Phe 
355 360 365 



1 1 0 



agg tea tct gtc cag ggt gca teg age egg gag ggc tea cca gee agg 
Arg Ser Ser Val Gin Gly Ala Ser Ser Arg Glu Giy Ser Pro Aia Arg 
370 375 380 



us: 



age age acg cca etc cac tec ccc teg ccc ate cgt gtg cac acc gtg 
Ser Ser Thr Pro Leu His Ser Pro Ser Pro lie Arg Val His Thr Val 
385 390 395 400 



12 00 



gtc gac agg cct cag cag ccc atg acc cat cga gaa act gca cct gtt 
Val Asp Arg Pro Gin Gin Pro Met Thr His Arg Glu Thr Ala Pro Val 
405 410 415 



1248 



tec cag cct gaa aac aaa cca gaa agt aag cca ggc cca gtt gga cca 
Ser Gin Pro Glu Asn Lys Pro Glu Ser Lys Pro Gly Pro Val Gly Pro 
420 425 430 



1296 



gaa etc cct cct gga cac ate cca att caa gtg ate cgc aaa gag gtg 
Glu Leu Pro Pro Gly His lie Pro lie Gin Val lie Arg Lys Glu Val 
435 440 445 



134 4 



gat tct aaa cct gtt tec cag aag ccc cca cct ccc tct gag aag gta 
Asp Ser Lys Pro Val Ser Gin Lys Pro Pro Pro Pro Ser Glu Lys Val 
450 455 460 



1392 



gag gtg aaa gtt ccc cct get cca gtt cct tgt cct cct ccc age cct 
Glu Val Lys Val Pro Pro Ala Pro Val Pro Cys Pro Pro Pro Ser Pro 
465 470 475 480 



14 4 0 



ggc cct tct get gtc ccc tct tec ccc aag agt gtg get aca gaa gag 
Gly Pro Ser Ala Val Pro Ser Ser Pro Lys Ser Val Ala Thr Glu Glu 
485 490 495 



1488 



agg gca gec ccc age act gee cct gca gaa get aca cct cca aaa cca 
Arg Ala Ala Pro Ser Thr Ala Pro Ala Glu Ala Thr Pro Pro Lys Pro 
500 505 510 



1536 



gga gaa gee gag get ccc cca aaa cat cca gga gtg ctg aaa gtg gaa 
Gly Glu Ala Glu Ala Pro Pro Lys His Pro Gly Val Leu Lys Val Glu 
515 520 525 



1584 



gee ate ctg gag aag gtg cag ggg ctg gag cag get gta gac aac ttt 1632 



10 
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Ala lie Leu Glu Lys Va 1 Gin Giy Leu Giu Gin Ala Vai Asp Asn Phe 
530 535 540 



gaa ggc aag aag act gac aaa aag tac ctg atg ate gaa gag tat ttg 
Glu Gly Lys Lys Thr Asp Lys Lys Tyr Leu Met He Glu Giu Tyr Leu 
545 550 555 560 



1680 



acc aaa gag ctg ctg gec ctg gat tea gtg gac ccc gag gga cga gec 
Thr Lys Glu Leu Leu Ala Leu Asp Ser Val Asp Pro Glu Giy Arg Ala 
565 570 575 



1728 



gat gtg cgt cag gec agg aga gac ggt gtc agg aag gtt cag acc ate 
Asp Val Arg Gin Ala Arg Arg Asp Gly Val Arg Lys Val Gin Thr He 
580 585 590 



17 7 6 



ttg gaa aaa ctt gaa cag aaa gec att gat gtc cca ggt caa gtc cag 
Leu Glu Lys Leu Glu Gin Lys Ala He Asp Val Pro Gly Gin Val Gin 

595 600 605 



is: 



gtc tat gaa etc cag ccc age aac ctt gaa gca gat cag cca ctg cag 
Vai Tyr Glu Leu Gin Pro Ser Asn Leu Glu Ala Asp Gin Pro Leu Gin 
610 615 620 



1872 



gca ate atg gag atg ggt gec gtg gca gca gac aag ggc aag aaa aat 
Ala He Met Glu Met Gly Ala Val Ala Ala Asp Lys Giy Lys Lys Asn 
625 630 635 640 



1920 



get gga aat gca gaa gat ccc cac aca gaa acc cag cag cca gaa gee 
Ala Gly Asn Ala Glu Asp Pro His Thr Glu Thr Gin Gin Pro Glu Ala 
645 650 655 



1968 



aca gca gca gcg act tea aac ccc age age atg aca gac acc cct ggt 
Thr Ala Ala Ala Thr Ser Asn Pro Ser Ser Met Thr Asp Thr Pro Gly 
660 665 670 



2016 



aac cca gca gca ccg tagcctctgc cctgtaaaag teagactegg aaccgatgtg 2071 
Asn Pro Ala Ala Pro 
675 

tgctttaggg attttagttg catgeattte agagacttta ggtcagttgg ttttgattag 2131 

ctgcttggta tgeagtaett gggtgaggca aacactataa agggctaaaa gggaaaatga 2191 

tgettttett caatattctt actcttgtac aattaangaa gttgcttgtt gtttgagaag 2251 

tttaaccccg ttgcttgttc tgcagccctg tcnacttggg cacccccacc acctgttagc 2311 

tgtggttgtg cactgtcttt tgtagctctg gactggaggg gtagatgggg agtcaattac 2371 

1 1 



106A1 iA> 




WO 00/14106 

ccatcacata aala^qaaao atttatcaga 

tcatctcata attaaaatac ctgactttag 

atatctgta: gttggatgac tttaatgcta 




PCT/US99/21053 

aatgttgcca ttttaatgag atgattttct 2431 
agagagtaaa atgtgccagg agccatagga 2491 
catttth 2528 



<2i0^> 6 

<211> 677 

<212> PRT 

<213> Homo sapiens 

<400.-> 6 

Ala Glu Leu Arg lie Gin Pro Arg Ala Ala Ala Asn Phe Ser Gly Leu 
15 10 15 

Asp Gin Lys Phe Leu Ala Gly Gin Leu Leu Pro Pro Phe lie Ser Ser 

20 25 30 

Phe Pro Ser Gly Ser Glu Glu Ala lie Ser Arg His Phe His Pro Ser 
35 40 45 

Leu Ala Thr Ser Pro Pro Pro Leu lie His Lys Gly Ala Arg Arg Arg 
50 55 60 

Leu Pro Gly His Val Gly Gly Gly Glu Gly Pro Thr Ala Ala Ala Arg 
65 70 75 80 

Pro Glu Thr Arg Arg Pro Glu Pro Ala Pro Arg Thr Arg Ala Pro Ala 
85 90 95 

Gly Arg Pro Gin Pro Ser Met Ser Ala Ala Thr His Ser Pro Met Met 
100 105 110 

Gin Val Ala Ser Gly Asn Gly Asp Arg Asp Pro Leu Pro Pro Gly Trp 
115 120 125 

Glu lie Lys lie Asp Pro Gin Thr Gly Trp Pro Phe Phe Val Asp His 
130 135 140 

Asn Ser Arg Thr Thr Thr Trp Asn Asp Pro Arg Val Pro Ser Glu Gly 
145 150 155 160 

Pro Lys Glu Thr Pro Ser Ser Ala Asn Gly Pro Ser Arg Glu Gly Ser 
165 170 175 

Arg Leu Pro Pro Ala Arg Glu Gly His Pro Val Tyr Pro Gin Leu Arg 

12 
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180 185 190 

Pro Giy Tyr lie Pro lie Pro Val Leu His Glu Gly Ala Glu Asn Arq 
195 200 205 

Gin Val His Pro Phe His Val Tyr Pro Gin Pro Gly Met Gin Arg Phe 
210 215 220 

Arg Thr Glu Ala Ala Ala Ala Ala Pro Gin Arg Ser Gin Ser Pro Leu 
225 230 235 240 

Arg Gly Met Pro Glu Thr Thr Gin Pro Asp Lys Gin Cys Gly Gin Val 
245 250 255 

Ala Ala Ala Ala Ala Ala Gin Pro Pro Ala Ser His Gly Pro Glu Arg 
260 265 270 

Ser Gin Ser Pro Ala Ala Ser Asp Cys Ser Ser Ser Ser Ser Ser Aid 
275 280 285 

Ser Leu Pro Ser Ser Gly Arg Ser Ser Leu Gly Ser His Gin Leu Pro 
290 295 300 

Arg Gly Tyr lie Ser lie Pro Val lie His Glu Gin Asn Val Thr Arg 
305 310 315 320 

Pro Ala Ala Gin Pro Ser Phe His Lys Ala Gin Lys Thr His Tyr Pro 
325 330 335 

Ala Gin Arg Gly Glu Tyr Gin Thr His Gin Pro Val Tyr His Lys He 
340 345 350 

Gin Gly Asp Asp Trp Glu Pro Arg Pro Leu Arg Ala Ala Ser Pro Phe 
355 360 365 

Arg Ser Ser Val Gin Gly Ala Ser Ser Arg Glu Gly Ser Pro Ala Arg 
370 375 380 

Ser Ser Thr Pro Leu His Ser Pro Ser Pro He Arg Val His Thr Val 
385 390 395 400 

Val Asp Arg Pro Gin Gin Pro Met Thr His Arg Glu Thr Ala Pro Val 
405 410 415 

Ser Gin Pro Glu Asn Lys Pro Glu Ser Lys Pro Gly Pro Val Gly Pro 
420 425 430 

Glu Leu Pro Pro Gly His He Pro He Gin Val He Arg Lys Glu Val 

13 
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4 3 5 440 44 5 

Asp Ser Lys Pro Val Ser Gin Lys Pro Pre Pro Pro Sor Glu Lys Val 
4 50 4 55 4 60 

Glu Val Lys Val Pro Pro Ala Pro Val Pro Cys Pro Pro Pro Ser Pro 
465 470 475 480 

Gly Pro Ser Ala Val Pro Ser Ser Pro Lys Ser Val Ala Thr Glu Glu 
485 490 495 

Arg Ala Ala Pro Ser Thr Ala Pro Ala Glu Ala Thr Pro Pro Lys Pro 
500 505 510 

Gly Glu Ala Glu Ala Pro Pro Lys His Pro Gly Val Leu Lys Val Glu 
515 520 525 

Ala He Leu Glu Lys Val Gin Gly Leu Glu Gin Ala Val Asp Asn Phe 
530 535 540 

Glu Gly Lys Lys Thr Asp Lys Lys Tyr Leu Met He Glu Glu Tyr Leu 
545 550 555 560 

Thr Lys Glu Leu Leu Ala Leu Asp Ser Val Asp Pro Glu Gly Arg Ala 
565 570 575 

Asp Val Arg Gin Ala Arg Arg Asp Gly Val Arg Lys Val Gin Thr lie 
580 585 590 

Leu Glu Lys Leu Glu Gin Lys Ala He Asp Val Pro Gly Gin Val Gin 
595 600 605 

Val Tyr Glu Leu Gin Pro Ser Asn Leu Glu Ala Asp Gin Pro Leu Gin 
610 615 620 

Ala He Met Glu Met Gly Ala Val Ala Ala Asp Lys Gly Lys Lys Asn 
625 630 635 640 

Ala Gly Asn Ala Glu Asp Pro His Thr Glu Thr Gin Gin Pro Glu Ala 
645 650 655 

Thr Ala Ala Ala Thr Ser Asn Pro Ser Ser Met Thr Asp Thr Pro Gly 
660 665 670 

Asn Pro Ala Ala Pro 

675 



1 4 
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<211 
<212 
<213 



> 1 

> 101 0 
• DNA 

- Home sapiens 



<220> 
<221> CDS 

<222> (323) . . (1009) 
<400> 7 

acgatatcc: gtaagaccaa gaattgcaag gccagagttt gaattcttat acaaatggag 60 

cgtatggtcc aacatacccc ccaggccctg gggcaaatac tgcctcatac tcaggggctt 120 

attatgeace tggttatact cagaccagtt actccacaga agttccaagt acttacegtt 180 

catctggcaa cagcccaact ccagtctctc gttggatcta tccccagcag gactgtcaag 240 

actgaagcac cccctcttaa ggggcaggtt ccaggatatc cgccttcaca gaaccctgga 300 

atgaccctgc cccattatcc tt atg gag atg gta ate gta gtg ttc cac aat 352 

Met Glu Met Val lie Val Val Phe His Asn 
1 5 10 

cac ggc cga ctg tac gac cac aag aaa gat gcg tgg get tct cct ggt 400 
His Giy Arg Leu Tyr Asp His- Lys Lys Asp Ala Trp Ala Ser Pro Gly 
15 20 25 

get tat gga atg ggt ggc cgt tat ccc tgg cct tea tea gcg ccc tea 448 
Ala Tyr Gly Met Gly Gly Arg Tyr Pro Trp Pro Ser Ser Ala Pro Ser 
30 35 40 

gca cca ccc ggc aat etc tac atg act gaa agt act tea cca tgg cct 496 
Ala Pro Pro Gly Asn Leu Tyr Met Thr Glu Ser Thr Ser Pro Trp Pro 
45 50 55 

age agt ggc tct ccc cag tea ccc cct tea ccc cca gtc cag cag ccc 544 
Ser Ser Gly Ser Pro Gin Ser Pro Pro Ser Pro Pro Val Gin Gin Pro 
60 65 70 

aag gat tct tea tac ccc tat age caa tea gat caa age atg aac egg 592 
Lys Asp Ser Ser Tyr Pro Tyr Ser Gin Ser Asp Gin Ser Met Asn Arg 
75 80 85 90 

cac aac ttt cct tgc agt gtc cat cag tac gaa tec teg ggg aca gtg 640 
His Asn Phe Pro Cys Ser Val His Gin Tyr Glu Ser Ser Giy Thr Val 
95 100 105 



15 
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aac aat gat gat tea gat ctt ttg gat tec caa gtc cag tat agt get 688 

Asn Asp. Asp Asp Ser Asp Leu Leu Asp Ser Gin Val Gin Tyr Ser Ala 

110 115 12 0 

gag cct cag ctg taz ggt aat gec acc agt gac cat ccc aac aat caa 736 

Glu Pro Gin Leu Tyr Gly Asn Ala Thr Ser Asp His Pro Asn Asn Gin 

125 130 135 

gat caa agt age agt ctt cct gaa gaa tgt gta cct tea gat gaa agt 784 

Asp Gin Ser Ser Ser Leu Pro Glu Glu Cys Val Pro Ser Asp Glu Ser 

140 145 150 

act cct ccg agt att aaa aaa ate ata cat gtg ctg gag aag gtc cag 832 

Thr Pro Pro Ser He Lys Lys He lie His Val Leu Glu Lys Val Gin 
155 160 165 170 

tat ctt gaa caa gaa gta gaa gaa ttt gta gga aaa aag aca gac aaa 880 

Tyr Leu Glu Gin Glu Val Glu Glu Phe Val Gly Lys Lys Thr Asp Lys 
175 180 185 

gca tac tgg ctt ctg gaa gaa atg eta acc aag gaa ctt ttg gaa ctg 928 

Ala Tyr Trp Leu Leu Glu Glu Met Leu Thr Lys Glu Leu Leu Glu Leu 

190 195 200 

gat tea gtt gaa act ggg ggc cag gac tct gta egg cag gee aga aaa 976 

Asp Ser Val Glu Thr Gly Gly Gin Asp Ser Val Arg Gin Ala Arg Lys 

205 210 215 

gag get gtt tgt aag att cag gee ata ttg gaa a 1010 

Glu Ala Val Cys Lys He Gin Ala He Leu Glu 

220 225 



<210> 8 
<211> 229 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Met Glu Met Val He Val Val Phe His Asn His Gly Arg Leu Tyr Asp 
15 10 15 

His Lys Lys Asp Ala Trp Ala Ser Pro Gly Ala Tyr Gly Met Gly Gly 
20 25 30 

Arg Tyr Pro Trp Pro Ser Ser Ala Pro Ser Ala Pro Pro Gly Asn Leu 
35 40 45 



16 
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Tyr Met Thr Glu Ser Thr Ser Pro Trp Pro Ser Ser Gly Ser Pro Gin 
5C 55 60 

Sor Pro Pro Ser Pro Pro Vai Gin Gin Pre Lys Asp Ser Ser Tyr Pro 
65 70 75 80 

Tyr Ser Gin Ser Asp Gin Ser Met Asn Arg His Asn Phe Pro Cys Ser 

85 90 95 

Val His Gin Tyr Glu Ser Ser Gly Thr Val Asn Asn Asp Asp Ser Asp 
100 105 110 

Leu Leu Asp Ser Gin Val Gin Tyr Ser Ala Glu Pro Gin Leu Tyr Gly 
115 120 125 

Asn Ala Thr Ser Asp His Pro Asn Asn Gin Asp Gin Ser Ser Ser Leu 
130 135 140 

Pro Glu Glu Cys Val Pro Ser Asp Glu Ser Thr Pro Pro Ser He Lys 
145 150 155 160 

Lys He lie His Val Leu Glu Lys Val Gin Tyr Leu Glu Gin Glu Val 
165 170 175 

Glu Glu Phe Val Gly Lys Lys Thr Asp Lys Ala Tyr Trp Leu Leu Glu 
180 185 190 

Glu Met Leu Thr Lys Glu Leu Leu Glu Leu Asp Ser Val Glu Thr Gly 
195 200 205 

Gly Gin Asp Ser Val Arg Gin Ala Arg Lys Glu Ala Vai Cys Lys He 
210 215 220 

Gin Ala lie Leu Glu 
225 



<210> 9 

<211> 689 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> ( 3) . . ( 482) 

<220> 

<221> unsure 

17 
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<222> (10S) 

< 2 2 3 > an y ax i no acid 

<400> 9 

ga gaa ata aaa aat gaa ctt etc caa gca caa aac cct tct gaa ttg 47 
Glu lie Lys Asn Glu Leu Leu Gin Ala Gin Asn Pro Ser Glu Leu 
15 10 15 

tac ctg age tec aaa aca gaa ttg cag ggt tta att gga cag ttg gat 95 

Tyr Leu Ser Ser Lys Thr Glu Leu Gin Gly Leu lie Giy Gin Leu Asp 

20 25 30 

gag gta agt att gaa aaa aac ccc tgc ate egg gaa gee agg aga aga 143 

Glu Val Ser Xaa Glu Lys Asn Pro Cys lie Arg Glu Ala Arg Arg Arg 
35 40 45 



gca gtg ate gag gtg caa act ctg ate aca tat att gac ttg aag gag 191 

Ala Val He Glu Val Gin Thr Leu He Thr Tyr He Asp Leu Lys Glu 

50 55 60 

gee ctt gag aaa aga aag ctg ttt get tgt gag gag cac cca tec cat 239 

Ala Leu Glu Lys Arg Lys Leu Phe Ala Cys Glu Glu His Pro Ser His 

65 70 75 

aaa gee gtc tgg aac gtc ctt gga aac ttg tct gag ate cag gga gaa 287 

Lys Ala Val Trp Asn Val Leu Gly Asn Leu Ser Glu He Gin Gly Glu 

80 85 90 95 

gtt ctt tea ttt gat gga aat cga ace gat aag aac tac ate egg ctg 335 

Val Leu Ser Phe Asp Gly Asn Arg Thr Asp Lys Asn Tyr lie Arg Leu 
100 105 110 

gaa gag ctg etc acc aag cag ctg eta gec ctg gat get gtt gat ccg 383 

Glu Glu Leu Leu Thr Lys Gin Leu Leu Ala Leu Asp Ala Val Asp Pro 

115 120 125 

cag gga gaa gag aag tgt aag get gec agg aaa caa get gtg agg ctt 431 

Gin Gly Glu Glu Lys Cys Lys Ala Ala Arg Lys Gin Ala Val Arg Leu 

130 135 140 

gcg cag aat att etc age tat etc gac ctg aaa tct gat gaa tgg gag 479 

Ala Gin Asn He Leu Ser Tyr Leu Asp Leu Lys Ser Asp Glu Trp Glu 

145 150 155 

tac tgaaatacca gagatctcac ttttgatact gttttgeact tcatatgtgc 532 

Tyr 

160 
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ttctacgtai agagagcttt cagttcattg atttatacgt gcatatttca gcctcagta: 592 

ttatgartga agcaaattct attcagtatc tgctgctttt gatgttgcaa gacaaaeatc 652 

atta;agcac gttaactttr ccattcggat caaaaaa 689 



<210> 10 

<211> 160 

<212> PRT 

<213> Homo sapiens 



<400> 10 

Glu lie Lys Asn Glu Leu Leu Gin Ala Gin Asn Pro Ser Glu Leu Tyr 
1 5 10 15 

Leu Ser Ser Lys Thr Glu Leu Gin Gly Leu lie Gly Gin Leu Asp Glu 

20 25 30 

Vai Ser Xaa Glu Lys Asn Pro Cys lie Arg Glu Ala Arg Arg Arg Ala 
35 40 45 

Val He Glu Val Gin Thr Leu He Thr Tyr He Asp Leu Lys Glu Ala 
50 55 60 

Leu Glu Lys Arg Lys Leu Phe Ala Cys Glu Glu His Pro Ser His Lys 
65 "70 75 80 

Ala Val Trp Asn Val Leu Gly Asn Leu Ser Glu He Gin Gly Glu Val 

85 90 95 

Leu Ser Phe Asp Gly Asn Arg Thr Asp Lys Asn Tyr He Arg Leu Glu 
100 105 HO 

Glu Leu Leu Thr Lys Gin Leu Leu Ala Leu Asp Ala Val Asp Pro Gin 
115 120 125 

Gly Glu Glu Lys Cys Lys Ala Ala Arg Lys Gin Ala Val Arg Leu Ala 
130 135 140 

Gin Asn He Leu Ser Tyr Leu Asp Leu Lys Ser Asp Glu Trp Glu Tyr 

145 150 155 160 



<210> 11 

<211> 246 

<212> DNA 

<213> Caenorhabdi tis elegans 
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< 4 0 0 > 11 

atgt :tttcc gcctcttcgt tgaaatattt cactttctitt: tccagct t t t tccccatctc 60 

gacc^gcttt ggtttttcga gaaaaccacg ttccaaatca gegacatctc tcaaattgag 120 

atcataggct ttttgaagat tgctcaaatt atgcttctca cattgcatga gcattttgaa 180 

gcccgcgtca tcaaccaaag cattttttcc acccatcaca acgattrtat cattttcttt 240 
aaaait 24 6 



<210 
<211 
<212 
<213 



12 

210 

PRT 

Caenorhabditis elegans 



<^00:> 12 

Met Lys Val Asn Val Ser Cys Ser Ser Val Gin Thr Thr lie Asp lie 
15 10 15 

Leu Glu Glu Asn Gin Gly Glu Asp Glu Ser lie Leu Thr Leu Gly Gin 
20 25 30 

Leu Arg Asp Arg lie Ala Thr Asp Asn Asp Val Asp Val Glu Thr Met 
3 5 4 0 4 5 

Lys Leu Leu His Arg Gly Lys Phe Leu Gin Gly Ala Asp Asp Val Ser 

50 55 60 

Leu Ser Thr Leu Asn Phe Lys Glu Asn Asp Lys lie lie Val Met Gly 
65 70 75 80 

Gly Lys Asn Ala Leu Val Asp Asp Ala Gly Phe Lys Met Leu Met Gin 
85 90 95 

Tyr Glu Lys His Asn Leu Ser Asn Leu Gin Lys Ala Tyr Asp Leu Asn 
100 105 110 

Leu Arg Asp Val Ala Asp Leu Glu Arg Gly Phe Leu Glu Lys Pro Lys 
115 120 125 

Gin Val Glu Met Gly Lys Lys Leu Glu Lys Lys Val Lys Tyr Phe Asn 
130 135 140 

Glu Glu Ala Glu Arg His Leu Glu Thr Leu Asp Gly Met Asn lie lie 
145 150 155 160 
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Thr Giu Thr Thr Pro Giu Asn Gin Ala Lys Arg Asn Arg Glu Lys Arg 

165 170 175 

Lys Thr Leu Val Asn Gly He Gin Thr Leu Leu Asn Gin Asn Asp Ala 

180 185 190 

Leu Leu Arg Arg Leu Gin Glu Tyr Gin Ser Val Leu Asn Gly Asp He 

195 200 205 



Pro Glu 
210 



<210> 13 

<211> 1377 

<212> DNA 

<213> Caenorhabdit is elegans 

< 2 2 0 > 

<221> CDS 

<222> (1) . . (1377) 

<400> 13 

atg cca gtc gtg aac ata cca ate 
Met Pro Val Val Asn lie Pro He 
1 5 

cat agt cga agt aac tec teg tct 
His Ser Arg Ser Asn Ser Ser Ser 
20 



aaa ata ctt ggt cag aat caa tea 48 

Lys He Leu Gly Gin Asn Gin Ser 

10 15 

tct gtt gac aac gat cga aat caa 96 

Ser Val Asp Asn Asp Arg Asn Gin 

25 30 



cca cca cag cag cca cct caa ccg caa cca caa cag caa tct cag caa 144 

Pro Pro Gin Gin Pro Pro Gin Pro Gin Pro Gin Gin Gin Ser Gin Gin 

35 40 45 

caa tac cag cag get cca aac gtg aat ace aat atg cat cat tec aac 192 

Gin Tyr Gin Gin Ala Pro Asn Val Asn Thr Asn Met His His Ser Asn 

50 55 60 

gga ttc tea cct aac ttc cca tct cgt agt cct att ccg gac ttt ccc 240 

Gly Phe Ser Pro Asn Phe Pro Ser Arg Ser Pro lie Pro Asp Phe Pro 

65 70 75 80 

agt ttt tea tct ggg ttc cca aac gat tct gaa tgg tct teg aat ttc 288 

Ser Phe Ser Ser Gly Phe Pro Asn Asp Ser Glu Trp Ser Ser Asn Phe 

85 90 95 
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ccg teg ttt cca aat ttc cca agt gga ttc tea aat gga agt tet aa: 336 

Pro Ser Phe Pro Asn Phe Pro Ser Gly Phe Ser Asr, Gly Ser Ser Asn 

100 ICS 110 

ttc cct gat ttt cca aga ttc gga aga gat gga gga eta teg cca aac 384 

Phe Pro Asp Phe Pro Arg Phe Gly Arg Asp Giy Gly Leu Ser Pro Asn 

115 120 125 

cca ccg atg caa gga tac agg aga agt cca aca cca aca tea act caa 432 

Pro Pro Met Gin Gly Tyr Arg Arg Ser Pro Thr Pro Thr Ser Thr Gin 

130 135 140 

tct cca act tct aca tta aga cgc aac tct cag cag aat caa get cct 480 

Ser Pro Thr Ser Thr Leu Arg Arg Asn Ser Gin Gin Asn Gin Ala Pro 
145 150 155 160 

cca caa tat tct cag caa caa cca caa caa get caa caa cgt cag aca 528 

Pro Gin Tyr Ser Gin Gin Gin Pro Gin Gin Ala Gin Gin Arg Gin Thr 

165 170 175 

act cct ccg tea aca aaa get tea tct cga cca cca tct cgt act cgt 576 

Thr Pro Pro Ser Thr Lys Ala Ser Ser Arg Pro Pro Ser Arg Thr Arg 

180 185 190 

gaa cca aag gaa cct gag gta ccc gag aga cca gca gtt att cca ttg 624 

Glu Pro Lys Glu Pro Glu Val Pro Glu Arg Pro Ala Val lie Pro Leu 

195 200 205 

cca tat gag aag aag gag aaa cca ctg gag aag aaa ggt agt cgt gat 672 

Pro Tyr Glu Lys Lys Glu Lys Pro Leu Glu Lys Lys Giy Ser Arg Asp 

210 215 220 

tct gga aag ggt gat gag aac ctt gaa gag aac att gee aag ate acq 720 

Ser Gly Lys Gly Asp Glu Asn Leu Glu Glu Asn lie Ala Lys He Thr 
225 230 235 240 

ate gga aag aat aat tgc gag tta tgt ccg gaa caa gaa acg gac ggc 768 

He Gly Lys Asn Asn Cys Glu Leu Cys Pro Glu Gin Glu Thr Asp Gly 

245 250 255 

gac cca tct cca eta acc tec cca ate acc gaa gga aag cca aag aga 816 

Asp Pro Ser Pro Leu Thr Ser Pro He Thr Glu Gly Lys Pro Lys Arg 

260 265 270 

gga aag aaa ctt caa cgt aat caa agt gtt gtt gat ttc aat gee aag 864 

Gly Lys Lys Leu Gin Arg Asn Gin Ser Val Val Asp Phe Asn Ala Lys 

275 280 285 
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aca att gtt act ttg gat aaa att gaa tta caa gtt gag cag ttg aga 
Thr lie Val Thr Leu Asp Lys lie Glu Leu Gin Val Glu Gin Leu Arg 

290 29b 300 



912 



aaa aaa get get gaa etc gaa atg gaa aaa gag caa att ctt cgt tct 
Lys Lys Ala Ala Glu Leu Glu Met Glu Lys Glu Gin lie Leu Arg Ser 
305 310 315 320 



960 



eta gga gaa ate agt gtt cat aac tgc atg ttc aaa ctg gaa gaa tgt 
Leu Gly Glu lie Ser Val His Asn Cys Met Phe Lys Leu Glu Glu Cys 

325 330 335 



1008 



gat cgt gaa gag att gaa gca ate act gac cga ttg aca aaa aga aca 
Asp Arg Glu Glu lie Glu Ala lie Thr Asp Arg Leu Thr Lys Arg Thr 
340 345 350 



1056 



aag aca gtt caa gtt gtt gtc gaa act cca cga aat gaa gaa cag aaa 
Lys Thr Val Gin Val Val Val Glu Thr Pro Arg Asn Glu Glu Gin Lys 
355 360 365 



1104 



aaa gca ctg gaa gat gca act ttg atg ate gat gaa gtc gga gaa atg 
Lys Ala Leu Glu Asp Ala Thr Leu Met lie Asp Glu Val Gly Glu Met 
370 375 380 



1152 



atg cat teg aat att gaa aag get aag ctg tgc eta caa ace tac atg 
Met His Ser Asn lie Glu Lys Ala Lys Leu Cys Leu Gin Thr Tyr Met 
385 390 395 400 



1200 



aac gee tgt teg tac gaa gaa act get gga gee acc tgc caa aac ttc 
Asn Ala Cys Ser Tyr Glu Glu Thr Ala Gly Ala Thr Cys Gin Asn Phe 
405 410 415 



124 8 



ttg aag ate ata att cag tgc get get gat gat cag aaa cgc ate aag 
Leu Lys lie lie lie Gin Cys Ala Ala Asp Asp Gin Lys Arg lie Lys 
420 425 430 



1296 



cgt cgt ctg gaa aat ctg atg tct caa att gag aat get gag aga acg 
Arg Arg Leu Glu Asn Leu Met Ser Gin lie Glu Asn Ala Glu Arg Thr 
435 440 445 



1344 



aaa gca gat ttg atg gat gat caa age gaa tag 
Lys Ala Asp Leu Met Asp Asp Gin Ser Glu 
450 455 



1377 



<210> 14 
<211> 458 
<212> PRT 
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<2I3> Caenorhabdi t is eleaans 



<400> 14 
Met Pro Val Val 
1 

His Ser Arg Ser 
20 

Pro Pro Gin Gin 

35 

Gin Tyr Gin Gin 
50 

Gly Phe Ser Pro 
65 

Ser Phe Ser Ser 



Pro Ser Phe Pro 
100 

Phe Pro Asp Phe 
115 

Pro Pro Met Gin 
130 

Ser Pro Thr Ser 
145 

Pro Gin Tyr Ser 



Thr Pro Pro Ser 
180 

Glu Pro Lys Glu 
195 

Pro Tyr Glu Lys 
210 

Ser Gly Lys Gly 
225 



Asn lie Pro lie 
5 

Asn Ser Ser Ser 



Pro Pro Gin Pro 
40 

Ala Pro Asn Val 

55 

Asn Phe Pro Ser 
70 

Gly Phe Pro Asn 
85 

Asn Phe Pro Ser 



Pro Arg Phe Gly 
120 

Gly Tyr Arg Arg 

135 

Thr Leu Arg Arg 
150 

Gin Gin Gin Pro 
165 

Thr Lys Ala Ser 



Pro Glu Val Pro 
200 

Lys Glu Lys Pro 
215 

Asp Glu Asn Leu 
230 



Lys lie Leu Gly 
1C 

Ser Val Asp Asn 
25 

Gin Pro Gin Gin 



Asn Thr Asn Met 
60 

Arg Ser Pro lie 

75 

Asp Ser Glu Trp 
90 

Gly Phe Ser Asn 
105 

Arg Asp Gly Gly 



Ser Pro Thr Pro 
140 

Asn Ser Gin Gin 
155 

Gin Gin Ala Gin 
170 

Ser Arg Pro Pro 
185 

Glu Arg Pro Ala 



Leu Glu Lys Lys 
220 

Glu Glu Asn lie 
235 



Gin Asn Gin Ser 
15 

Asp Arg Asn Gin 

30 

Gin Ser Gin Gin 
45 

His His Ser Asn 



Pro Asp Phe Pro 
80 

Ser Ser Asn Phe 
95 

Gly Ser Ser Asn 
110 

Leu Ser Pro Asn 
125 

Thr Ser Thr Gin 



Asn Gin Ala Pro 
160 

Gin Arg Gin Thr 
175 

Ser Arg Thr Arg 
190 

Val lie Pro Leu 
205 

Gly Ser Arg Asp 



Ala Lys lie Thr 
240 
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He Gly Lys Asn Asn Cys Glu Leu Cys Pro Glu Gin Glu Thr Asp Gly 
245 250 255 

Asp Pro Ser Pro Leu Thr Ser Pro lie Thr Glu Gly Lys Pro Lys Arg 
260 265 270 

Gly Lys Lys Leu Gin Arg Asn Gin Ser Val Val Asp Phe Asn Ala Lys 
275 280 285 

Thr lie Val Thr Leu Asp Lys He Glu Leu Gin Val Glu Gin Leu Arg 
290 295 300 

Lys Lys Ala Ala Glu Leu Glu Met Glu Lys Glu Gin lie Leu Arg Ser 
305 310 315 320 

Leu Gly Glu He Ser Val His Asn Cys Met Phe Lys Leu Glu Glu Cys 

325 330 335 

Asp Arg Glu Glu He Glu Ala lie Thr Asp Arg Leu Thr Lys Arg Thr 
340 345 350 

Lys Thr Val Gin Val Val Val Glu Thr Pro Arg Asn Glu Glu Gin Lys 
355 360 . 365 

Lys Ala Leu Glu Asp Ala Thr Leu Met He Asp Glu Val Gly Glu Met 
370 375 380 

Met His Ser Asn lie Glu Lys Ala Lys Leu Cys Leu Gin Thr Tyr Met 
385 390 395 400 

Asn Ala Cys Ser Tyr Glu Glu Thr Ala Gly Ala Thr Cys Gin Asn Phe 
405 410 415 

Leu Lys He He lie Gin Cys Ala Ala Asp Asp Gin Lys Arg lie Lys 
420 425 430 

Arg Arg Leu Glu Asn Leu Met Ser Gin He Glu Asn Ala Glu Arg Thr 
435 440 445 

Lys Ala Asp Leu Met Asp Asp Gin Ser Glu 
450 455 



<210> 15 
<211> 588 
<212.> DNA 

<213:> Schizosaccharomyces pombe 
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< 2 2 0 > 

<221> CDS 

<222> (1) . . (588) 

<400> 15 

atg tea gaa aag act age aca gtt 

Met Ser Glu Lys Thr Ser Thr Val 
1 5 

ttt ccg gta gca gtc aat eta aat 
Phe Pro Val Ala Val Asn Leu Asn 
20 



aca ata cac tat gga aat cag cga 48 

Thr lie His Tyr Gly Asn Gin Arg 

10 15 

gag acg tta agt gaa ctg att gat 96 

Glu Thr Leu Ser Glu Leu lie Asp 

25 30 



gat tta ctt gaa acg act gag att tct gag aag aaa gtc aag ctt ttt 144 

Asp Leu Leu Glu Thr Thr Glu lie Ser Glu Lys Lys Val Lys Leu Phe 

35 40 45 

tac get ggc aag cgt tta aaa gac aaa aaa gec teg tta tea aaa ttg 192 

Tyr Ala Gly Lys Arg Leu Lys Asp Lys Lys Ala Ser Leu Ser Lys Leu 

50 55 60 

ggt tta aaa aat cat agt aaa att eta tgt ata aga cca cat aag caa 240 

Gly Leu Lys Asn His Ser Lys lie Leu Cys lie Arg Pro His Lys Gin 

65 70 75 80 

caa cga ggt tec aag gaa aaa gac acg gtt gag ccc get ccg aaa gcg 288 

Gin Arg Gly Ser Lys Glu Lys Asp Thr Val Glu Pro Ala Pro Lys Ala 

85 90 95 

gaa gcg gag aat cct gta ttt teg cgt att tct gga gaa ata aaa gee 336 

Glu Ala Glu Asn Pro Val Phe Ser Arg lie Ser Gly Glu lie Lys Ala 

100 105 110 

ate gat cag tat gtt gac aaa gaa ctt tec ccc atg tac gac aat tac 384 

lie Asp Gin Tyr Val Asp Lys Glu Leu Ser Pro Met Tyr Asp Asn Tyr 

115 120 125 

gta aat aaa ccg teg aac gat cca aag cag aaa aac aaa cag aaa eta 432 

Val Asn Lys Pro Ser Asn Asp Pro Lys Gin Lys Asn Lys Gin Lys Leu 

130 135 140 

atg ata agt gaa eta ctt tta caa cag ctt tta aaa ttg gat gga gtt 480 

Met lie Ser Glu Leu Leu Leu Gin Gin Leu Leu Lys Leu Asp Gly Val 

145 150 155 160 

gac gta ctg ggc age gag aaa ttg cgt ttt gaa egg aag caa ctt gtt 528 

Asp Val Leu Gly Ser Glu Lys Leu Arg Phe Glu Arg Lys Gin Leu Val 

165 170 175 
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tct aag ate caa aaa atg ttg gat cac gtt gac caa aca age caa gaa 576 
Ser Lys lie Gin Lys Met Leu Asp His Val Asp Gin Thr Ser Gin Glu 
180 185 190 

gtg gec gca tag 538 
Val Ala Ala 
195 



<210> 16 
<211> 195 
< 2 1 2 > PRT 

<213> Schi zosaccharomyces pombe 
<400> 16 

Met Ser Glu Lys Thr Ser Thr Val Thr lie His Tyr Gly Asn Gin Arg 
15 10 15 

Phe Pro Val Ala Val Asn Leu Asn Glu Thr Leu Ser Glu Leu lie Asp 
20 25 30 

Asp Leu Leu Glu Thr Thr Glu lie Ser Glu Lys Lys Val Lys Leu Phe 
35 40 45 

Tyr Ala Gly Lys Arg Leu Lys Asp Lys Lys Ala Ser Leu Ser Lys Leu 
50 55 60 

Gly Leu Lys Asn His Ser Lys lie Leu Cys lie Arg Pro His Lys Gin 
65 70 75 80 

Gin Arg Gly Ser Lys Glu Lys Asp Thr Val Glu Pro Ala Pro Lys Ala 
85 90 95 

Glu Ala Glu Asn Pro Val Phe Ser Arg lie Ser Gly Glu lie Lys Ala 
100 105 HO 

lie Asp Gin Tyr Val Asp Lys Glu Leu Ser Pro Met Tyr Asp Asn Tyr 
115 120 125 

Val Asn Lys Pro Ser Asn Asp Pro Lys Gin Lys Asn Lys Gin Lys Leu 
130 135 140 

Met lie Ser Glu Leu Leu Leu Gin Gin Leu Leu Lys Leu Asp Gly Val 
145 150 155 160 

Asp Val Leu Gly Ser Glu Lys Leu Arg Phe Glu Arg Lys Gin Leu Val 
165 170 175 
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Ser Lys lie Gin Lys Met Leu Asp His Va 1 Asp Gin Thr Ser Gin Glu 
180 185 190 

Val Ala Ala 
195 



<210> 17 
<211> 621 
<212> DNA 

<213> Schizosaccharomyces pombe 

<220> 

<221> CDS 

<222> (1) . . (621) 

<400> 17 

atg tct ttt ttt acc cag ttg tgt tct atg gat aaa aaa tat tgg ate 48 

Met Ser Phe Phe Thr Gin Leu Cys Ser Met Asp Lys Lys Tyr Trp lie 

15 10 15 

tct eta get gta ttg tea gtt act gtt ttg att age gca tta ttg aaa 96 
Ser Leu Ala Val Leu Ser Val Thr Val Leu lie Ser Ala Leu Leu Lys 
20 25 30 



144 



aag aga get act gaa acc gaa gat att gtc gtt gtt cat tac gat ggc 

Lys Arg Ala Thr Glu Thr Glu Asp lie Val Val Val His Tyr Asp Gly 
35 40 45 

gaa aag ttg aat ttt gtg ttg cga caa cca agg ctg aat atg gtt tct 192 

Glu Lys Leu Asn Phe Val Leu Arg Gin Pro Arg Leu Asn Met Val Ser 
50 55 60 

tac act agt ttt ctt cgt cgc gtg tgc aac gca ttt tea gta atg ccc 240 

Tyr Thr Ser Phe Leu Arg Arg Val Cys Asn Ala Phe Ser Val Met Pro 
65 70 75 80 

gac aaa gcg tct etc aag tta aac ggg gtg acc etc aag gat ggt tea 288 

Asp Lys Ala Ser Leu Lys Leu Asn Gly Val Thr Leu Lys Asp Gly Ser 
85 90 95 

ctt tec gac caa aat gtg caa aat gga agt gaa tta gag etc gaa tta 336 

Leu Ser Asp Gin Asn Val Gin Asn Gly Ser Glu Leu Glu Leu Glu Leu 

100 105 HO 

ccc aaa ctg age ccg gca atg caa caa att gaa gca tat ata gat gag 384 

Pro Lys Leu Ser Pro Ala Met Gin Gin lie Glu Ala Tyr lie Asp Glu 
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115 120 

ctt caa cag gat etc gtc cct aaa att 
Leu Gin Gin Asp Leu Val Pro Lys lie 
130 135 

ccc get teg gca caa gat gtt caa gat 
Pro Ala Ser Ala Gin Asp Val Gin Asp 
145 150 

aca ttg ttg get agg atg ata aaa tta 
Thr Leu Leu Ala Arg Met lie Lys Leu 
165 




PCT/US99/21053 

125 

gaa gee ttc tgc caa teg tct 4 3/ 

Glu Ala Phe Cys Gin Ser Ser 
140 

ttg cat aca cgc ctt agt gaa 480 

Leu His Thr Arg Leu Ser Glu 
155 160 

gat get gtt aat gtt gaa gac 528 

Asp Ala Val Asn Val Glu Asp 

170 175 



gac cca gaa get cgt ctt aaa aga aaa gaa get att cgt tta tct caa 576 

Asp Pro Glu Ala Arg Leu Lys Arg Lys Glu Ala lie Arg Leu Ser Gin 
180 185 190 

caa tat ttg agt aaa eta gat tec acc aag aat caa aac aaa tga 621 

Gin Tyr Leu Ser Lys Leu Asp Ser Thr Lys Asn Gin Asn Lys 

195 200 205 



<210> 18 
<211> 206 
<212> PRT 

<213> Schi zosaccharomyces pombe 
<400> 18 

Met Ser Phe Phe Thr Gin Leu Cys Ser Met Asp Lys Lys Tyr Trp lie 
15 10 15 

Ser Leu Ala Val Leu Ser Val Thr Val Leu lie Ser Ala Leu Leu Lys 
20 25 30 

Lys Arg Ala Thr Glu Thr Glu Asp lie Val Val Val His Tyr Asp Gly 
35 40 45 

Glu Lys Leu Asn Phe Val Leu Arg Gin Pro Arg Leu Asn Met Val Ser 
50 55 60 

Tyr Thr Ser Phe Leu Arg Arg Val Cys Asn Ala Phe Ser Val Met Pro 
65 70 75 80 

Asp Lys Ala Ser Leu Lys Leu Asn Gly Val Thr Leu Lys Asp Gly Ser 
85 90 95 

Leu Ser Asp Gin Asn Val Gin Asn Gly Ser Glu Leu Glu Leu Glu Leu 
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100 105 110 



Pro Lys Leu Ser 
115 

Leu Gin Gin Asp 
130 

Pro Ala Ser Ala 
145 

Thr Leu Leu Ala 



Asp Pro Glu Ala 
180 

Gin Tyr Leu Ser 
195 



Pro Ala Met Gin 
120 

Leu Val Pro Lys 
135 

Gin Asp Val Gin 
150 

Arg Met lie Lys 
165 

Arg Leu Lys Arg 



Lys Leu Asp Ser 
200 



Gin lie Glu Ala 



He Glu Ala Phe 
140 

Asp Leu His Thr 
155 

Leu Asp Ala Val 
170 

Lys Glu Ala He 
185 

Thr Lys Asn Gin 



Tyr He Asp Glu 
125 

Cys Gin Ser Ser 



Arg Leu Ser Glu 
160 

Asn Val Glu Asp 
175 

Arg Leu Ser Gin 
190 

Asn Lys 
205 



<210> 19 
<2il> 2534 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (307) . . (2034) 



<400> 19 

gcggagctcc gcatccaacc ccgggccgcg gccaacttct ctggactgga ccagaagttt 60 

ctagccggcc agttgctacc tccctttatc tcctccttcc cctctggcag cgaggaggct 120 

atttccagac acttccaccc ctctctggcc acgtcacccc cgcctttaat tcataaaggt 180 

gcccggcgcc ggcttcccgg acacgtcggc ggcggagagg ggcccacggc ggcggcccgg 240 

ccagagactc ggcgcccgga gccagcgccc cgcacccgcg ccccagcggg cagaccccaa 300 

cccagc atg age gec gec acc cac teg ccc atg atg cag gtg gcg tec 348 
Met Ser Ala Ala Thr His Ser Pro Met Met Gin Val Ala Ser 
1 5 10 



ggc aac ggt gac cgc gac cct ttg ccc ccc gga tgg gag ate aag ate 396 
Gly Asn Gly Asp Arg Asp Pro Leu Pro Pro Gly Trp Glu He Lys He 
15 20 25 30 
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gac ccg cag acc ggc tgg ccc ttc ttc gtg gac cac aac age cgc acc 444 

Asp Pro Gin Thr Gly Trp Pro Phe Phe Val Asp His Asn Ser Arg Thr 
35 40 45 

act acg tgg aac gac ccg cgc gtg ccc tct gag ggc ccc aag gag act 492 

Thr Thr Trp Asn Asp Pro Arg Val Pro Ser Glu Gly Pro Lys Glu Thr 
50 55 60 

cca tec tct gec aat ggc cct tec egg gag ggc tct agg ctg ccg cct 540 

Pro Ser Ser Ala Asn Gly Pro Ser Arg Glu Gly Ser Arg Leu Pro Pro 
65 70 75 

get agg gaa ggc cac cct gtg tac ccc cag etc cga cca ggc tac att 588 

Ala Arg Glu Gly His Pro Val Tyr Pro Gin Leu Arg Pro Gly Tyr lie 
80 85 90 

ccc att cct gtg etc cat gaa ggc get gag aac egg cag gtg cac cct 636 

Pro lie Pro Val Leu His Glu Gly Ala Glu Asn Arg Gin Val Hxs Pro 

95 100 105 110 

ttc cat gtc tat ccc cag cct ggg atg cag cga ttc cga act gag gcg 684 

Phe His Val Tyr Pro Gin Pro Gly Met Gin Arg Phe Arg Thr Glu Ala 
115 120 125 

gca gca gcg get cct cag agg tec cag tea cct ctg egg ggc atg cca 732 

Ala Ala Ala Ala Pro Gin Arg Ser Gin Ser Pro Leu Arg Gly Met Pro 
130 135 140 

gaa acc act cag cca gat aaa cag tgt gga cag gtg gca gcg gcg gcg 780 

Glu Thr Thr Gin Pro Asp Lys Gin Cys Gly Gin Val Ala Ala Ala Ala 
145 150 155 

gca gee cag ccc cca gec tec cac gga cct gag egg tec cag tct cca 828 

Ala Ala Gin Pro Pro Ala Ser His Gly Pro Glu Arg Ser Gin Ser Pro 
160 165 170 

get gee tct gac tgc tea tec tea tec tec teg gec age ctg cct tec 876 

Ala Ala Ser Asp Cys Ser Ser Ser Ser Ser Ser Ala Ser Leu Pro Ser 

175 180 185 190 

tec ggc agg age age ctg ggc agt cac cag etc ccg egg ggg tac ate 924 

Ser Gly Arg Ser Ser Leu Gly Ser His Gin Leu Pro Arg Gly Tyr lie 
195 200 205 

tec att ccg gtg ata cac gag cag aac gtt acc egg cca gca gee cag 972 

Ser He Pro Val He His Glu Gin Asn Val Thr Arg Pro Ala Ala Gin 
210 215 220 
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ccc tec ttc cac aaa gee cag aag acg cac tac cca gcg cag agg ggt 
PrD Ser Phe His Lys Ala Gin Lys Thr His Tyr Pro Ala Gin Arg Giy 
225 230 235 



1020 



gag tac cag acc cac cag cct gtg tac cac aag ate cag ggg gat gac 
Glu Tyr Glr. Thr His Gin Pro Val Tyr His Lys lie Gin Giy Asp Asp 
240 245 250 



1068 



tgg gag ccc egg ccc ctg egg gcg gca tec ccg ttc agg tea tct gtc 1116 
Trp Glu Pro Arg Pro Le j Arg Ai 3 Ala Ser Pro Phe Arg Ser Ser Val 
255 260 265 270 



cag ggt gca teg age egg gag gge tea cca gec agg age age acg cca 
Gin Giy Ala Ser Ser Arg Glu Giy Ser Pro Ala Arg Ser Ser Thr Pro 
275 280 285 



1164 



etc cac tec ccc teg ccc ate cgt gtg cac acc gtg gtc gac agg cct 
Leu His Ser Pro Ser Pro lie Arg Val His Thr Val Val Asp Arg Pro 
290 295 300 



1212 



cag cag ccc atg acc cat cga gaa act gca cct gtt tec cag cct gaa 
Gin Gin Pro Met Thr His Arg Glu Thr Ala Pro Val Ser Gin Pro Glu 
305 310 315 



1260 



aac aaa cca gaa agt aag cca gge cca gtt gga cca gaa etc cct cct 
Asn Lys Pro Glu Ser Lys Pro Giy Pro Val Giy Pro Glu Leu Pro Pro 
320 325 330 



1308 



gga cac ate cca att caa gtg ate cgc aaa gag gtg gat tct aaa cct 
Giy His lie Pro lie Gin Val lie Arg Lys Glu Val Asp Ser Lys Pro 
335 340 345 350 



1356 



gtt tec cag aag ccc cca cct ccc tct gag aag gta gag gtg aaa gtt 
Val Ser Gin Lys Pro Pro Pro Pro Ser Glu Lys Val Glu Val Lys Val 
355 360 365 



1404 



ccc cct get cca gtt cct tgt cct cct ccc age cct gge cct tct get 
Pro Pro Ala Pro Val Pro Cys Pro Pro Pro Ser Pro Giy Pro Ser Ala 
370 375 380 



1452 



gtc ccc tct tec ccc aag agt gtg get aca gaa gag agg gca gee ccc 
Val Pro Ser Ser Pro Lys Ser Val Ala Thr Glu Glu Arg Ala Ala Pro 
385 390 395 



1500 



age act gee cct gca gaa get aca cct cca aaa cca gga gaa gee gag 
Ser Thr Ala Pro Ala Glu Ala Thr Pro Pro Lys Pro Giy Glu Ala Giu 
400 405 410 



154 8 
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get ccc cca aaa cat cca gga gtg ctg aaa gtg gaa gec ate ctg gag 1596 
Ala Pro Pro Lys His Pro Gly Val Leu Lys Va 1 Glu Ala Ilo Leu Glu 
415 420 425 4 30 



aag gtg cag 
Lys Val Gin 



act gac aaa 
Thr Asp Lys 



ctg gec ctg 
Leu Ala Leu 
465 

gec agg aga 
Ala Arg Arg 
480 

gaa cag aaa 
Glu Gin Lys 
495 

cag ccc age 
Gin Pro Ser 



atg ggt gee 
Met Gly Ala 

gaa gat ccc 
Glu Asp Pro 
545 

act tea aac 
Thr Ser Asn 
560 



ggg ctg gag 
Gly Leu Glu 
435 

aag tac ctg 
Lys Tyr Leu 
4 50 

gat tea gtg 
Asp Ser Val 



gac ggt gtc 
Asp Gly Val 

gee att gat 
Ala He Asp 
500 

aac ctt gaa 
Asn Leu Glu 
515 

gtg gca gca 
Val Ala Ala 
5 30 

cac aca gaa 
His Thr Glu 



ccc age age 
Pro Ser Ser 



cag get gta 
Gin Ala Val 



atg ate gaa 

Met He Glu 
455 

gac ccc gag 

Asp Pro Glu 
470 

agg aag gtt 

Arg Lys Val 
485 

gtc cca ggt 

Val Pro Gly 



gca gat cag 
Ala Asp Gin 



gac aag ggc 
Asp Lys Gly 
535 

acc cag cag 
Thr Gin Gin 
550 

atg aca gac 
Met Thr Asp 
565 



gac aac ttt 

Asp Asn Phe 
440 

gag tat ttg 

Glu Tyr Leu 



gga cga gee 
Gly Arg Ala 



cag acc ate 

Gin Thr He 
490 

caa gtc cag 

Gin Val Gin 
505 

cca ctg cag 

Pro Leu Gin 
520 

aag aaa aat 

Lys Lys Asn 

cca gaa gec 

Pro Glu Ala 



acc cct ggt 
Thr Pro Gly 
570 



gaa ggc aag 
Glu Gly Lys 
4 45 

acc aaa gag 
Thr Lys Glu 
4 60 

gat gtg cgt 
Asp Val Arg 
475 

ttg gaa aaa 
Leu Glu Lys 

gtc tat gaa 

Val Tyr Glu 

gca ate atg 
Ala He Met 
525 

get gga aat 
Ala Gly Asn 
540 

aca gca gca 
Thr Ala Ala 
555 

aac cca gca 
Asn Pro Ala 



aag 1644 
Lys 



ctg 1692 
Leu 



cag 174 0 
Gin 

ctt : 75 

Leu 

etc 1836 

Leu 

510 

gag 1884 
Glu 



gca 193Z 
Ala 



gcg 198C 
Ala 



gca 2028 
Ala 



ccg tag cctctgccct gtaaaaatca gaeteggaac cgatgtgtgc tttagggaat 2084 

Pro 

575 

tttaagttgc atgeatttea gagactttaa gtcagttggt ttttattagc tgcttggtat 2144 
gcagtaactt gggtggaggc aaaacactaa taaaagggct aaaaaggaaa atgatgcttt 2204 
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tcttctata: tcttactctg tacaaataaa gaagttgct: gttgtttgag aagtttaacc 2 2 6 4 

ccgtcactcg ttctgcagcc ctgtctactt gggcaccccc accacctgtt agccgtggtt 2324 

gtgcactgtc ttttgtagct ctggactgga ggggtagatg gggagtcaat tacccatcac 2384 

araaanatga aacatttatc agaaatgttg ccattttaat gagatgattt tcttcatctc 2444 

ataaitaaaa tacctgactt tagagagagt aaaatgtgcc aggagccata ggaatatctg 2504 

tatgttggat gactttaatg ctacattttc 2534 



<210> 


20 


<211> 


575 


<212- 


PRT 


<213;- 


Homo 


<400> 


20 



Met Ser Ala Ala Thr His Ser Pro Met Met Gin Val Ala Ser Gly Asn 
15 10 15 

Gly Asp Arg Asp Pro Leu Pro Pro Gly Trp Glu lie Lys lie Asp Pro 
20 25 30 

Gin Thr Gly Trp Pro Phe Phe Val Asp His Asn Ser Arg Thr Thr Thr 
35 40 45 

Trp Asr. Asp Pro Arg Val Pro Ser Glu Gly Pro Lys Glu Thr Pro Ser 
50 55 60 

Ser Ala Asn Gly Pro Ser Arg Glu Gly Ser Arg Leu Pro Pro Ala Arg 
65 70 75 80 

Glu Gly His Pro Val Tyr Pro Gin Leu Arg Pro Gly Tyr lie Pro lie 
85 90 95 

Pro Val Leu His Glu Gly Ala Glu Asn Arg Gin Val His Pro Phe His 
100 105 110 

Val Tyr Pro Gin Pro Gly Met Gin Arg Phe Arg Thr Giu Ala Ala Ala 
115 120 125 

Ala Ala Pro Gin Arg Ser Gin Ser Pro Leu Arg Gly Met Pro Glu Thr 
13C 135 140 

Thr Gin Pro Asp Lys Gin Cys Gly Gin Val Ala Ala Ala Ala Ala Ala 
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145 150 155 160 

Gin Pro Pro Ala Ser His Gly Pro Glu Arq Ser Gin Ser Pro Ala Ala 
165 170 175 

Ser Asp Cys Ser Ser Ser Ser Ser Ser Ala Ser Leu Pro Ser Ser Gly 
180 185 190 

Arg Ser Ser Leu Gly Ser His Gin Leu Pro Arg Gly Tyr He Ser He 
195 200 205 

Pro Val He His Glu Gin Asn Val Thr Arg Pro Ala Ala Gin Pro Ser 
210 215 22 0 

Phe His Lys Ala Gin Lys Thr His Tyr Pro Ala Gin Arg Gly Glu Tyr 
225 230 235 240 

Gin Thr His Gin Pro Val Tyr His Lys He Gin Gly Asp Asp Trp Glu 
245 250 255 

Pro Arg Pro Leu Arg Ala Ala Ser Pro Phe Arg Ser Ser Val Gin Gly 
260 265 270 

Ala Ser Ser Arq Glu Gly Ser Pro Ala Arg Ser Ser Thr Pro Leu His 
275 280 285 

Ser Pro Ser Pro He Arg Val His Thr Val Val Asp Arg Pro Gin Gin 
290 295 300 

Pro Met Thr His Arg Glu Thr Ala Pro Val Ser Gin Pro Glu Asn Lys 
305 310 315 320 

Pro Glu Ser Lys Pro Gly Pro Val Gly Pro Glu Leu Pro Pro Gly His 

325 330 335 

He Pro He Gin Val He Arg Lys Glu Val Asp Ser Lys Pro Val Ser 
340 345 350 

Gin Lys Pro Pro Pro Pro Ser Glu Lys Val Glu Val Lys Val Pro Pro 
355 360 365 

Ala Pro Val Pro Cys Pro Pro Pro Ser Pro Gly Pro Ser Ala Val Pro 

370 375 380 

Ser Ser Pro Lys Ser Val Ala Thr Glu Glu Arg Ala Ala Pro Ser Thr 
385 390 395 400 

Ala Pro Ala Glu Ala Thr Pro Pro Lys Pro Gly Glu Ala Glu Ala Pro 
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405 41C 415 

Pro Lys His Pro Gly Val Leu Lys Val GIu Ala lie Leu Glu Lys Val 
420 425 430 

Gin Gly Leu GIu Gin Ala Val Asp Asn Phe Glu Gly Lys Lys Thr Asp 
435 440 445 

Lys Lys Tyr Leu Met lie Glu Glu Tyr Leu Thr Lys Glu Leu Leu Ala 
450 455 460 

Leu Asp Ser Val Asp Pro Glu Gly Arg Ala Asp Val Arg Gin Ala Arg 
465 470 475 480 

Arg Asp Gly Val Arg Lys Val Gin Thr lie Leu Glu Lys Leu Glu Gin 
485 490 495 

Lys Ala lie Asp Val Pro Gly Gin Val Gin Val Tyr Glu Leu Gin Pro 
50C 505 510 

Ser Asn Leu Glu Ala Asp Gin Pro Leu Gin Ala lie Met Glu Met Gly 
515 520 525 

Ala Val Ala Ala Asp Lys Gly Lys Lys Asn Ala Gly Asn Ala Glu Asp 
530 535 540 

Pro His Thr Glu Thr Gin Gin Pro Glu Ala Thr Ala Ala Ala Thr Ser 
545 550 555 560 

Asn Pro Ser Ser Met Thr Asp Thr Pro Gly Asn Pro Ala Ala Pro 
565 570 575 



<210> 21 

<211> 1966 

<212:» DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (43) . . (1416) 

<400.> 21 

cggtgggagc ggggcgggaa gcgcttcagg gcagcggatc cc atg teg gec ctg 54 

Met Ser Ala Leu 

1 



agg cgc teg ggc tac ggc ccc agt gac ggt ccg tec tac ggc cgc tac 102 
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Arg Arg Ser Giy Tyr Gly Pro Ser Asp Gly Pro Ser Tyr Gly Arg Tyr 

5 10 15 20 

tac ggg ccr_ ggg ggt gga gat gtg ccg qta cac cca cct cca ccc tta 150 

Tyr Gly Pro Gly Gly Gly Asp Val Pro Val His Pro Pro Pro Pro Leu 

25 30 35 

tat cct ctt eg: cct gaa cct ccc cag cct ccc att tec tgg egg gtg 198 

Tyr Pro Leu Arg Pro Glu Pro Pro Gin Pro Pro lie Ser Trp Arg Val 

4 0 4 5 50 

cgc ggg ggc ggc ccg gcg gag acc acc tgg ctg gga gaa ggc gga gga 246 

Arg Gly Gly Gly Pro Ala Glu Thr Thr Trp Leu Gly Glu Gly Gly Gly 

55 60 65 

ggc gat ggc tac tat ccc teg gga ggc gec tgg cca gag cct ggt cga 294 

Gly Asp Gly Tyr Tyr Pro Ser Gly Gly Ala Trp Pro Glu Pro Gly Arg 

1 C 7 5 8 0 

gec gga gga age cac cag gag cag cca cca tat cct age tac aat tct 342 

Ala Gly Gly Ser His Gin Glu Gin Pro Pro Tyr Pro Ser Tyr Asn Ser 

85 90 95 100 

aac tat tgg aat tct act gcg aga tct agg get cct tac cca agt aca 390 

Asn Tyr Trp Asn Ser Thr Ala Arg Ser Arg Ala Pro Tyr Pro Ser Thr 

105 110 115 

tat cct gta aga cca gaa ttg caa ggc cag agt ttg aat tct tat aca 438 

Tyr Pro Val Arg Pro Glu Leu Gin Gly Gin Ser Leu Asn Ser Tyr Thr 

120 125 130 

aat gga gcg tat ggt cca aca tac ccc cca ggc cct ggg gca aat act 486 

Asn Gly Ala Tyr Gly Pro Thr Tyr Pro Pro Gly Pro Gly Ala Asn Thr 

135 140 145 

gec tea tac tea ggg get tat tat gca cct ggt tat act cag acc agt 534 

Ala Ser Tyr Ser Gly Ala Tyr Tyr Ala Pro Gly Tyr Thr Gin Thr Ser 

150 155 160 

tac tec aca gaa gtt cca agt act tac cgt tea tct ggc aac age cca 582 

Tyr Ser Thr Glu Val Pro Ser Thr Tyr Arg Ser Ser Gly Asn Ser Pro 

165 170 175 180 

act cca gtc tct cgt tgg ate tat ccc cag cag gac tgt cag act gaa 630 

Thr Pro Val Ser Arg Trp lie Tyr Pro Gin Gin Asp Cys Gin Thr Glu 

185 190 195 

gca ccc cct ctt agg ggg cag gtt cca gga tat ccg cct tea cag aac 678 
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Ala Pro Pro Leu Arg Gly Gin Val Pro Gly Tyr Pro Pro Ser Gin Asn 

200 2C5 210 



cct gga atg acc ctg ccc cat tat cct tat gga gat ggt aat cgt agt 

Pro Gly Met Thr Leu Pro His Tyr Pro Tyr Gly Asp Gly Asn Arg Ser 

215 220 225 

gtt cca caa tea gga ccg act gta cga cca caa gaa gat gcg tgg get 

Val Pro Gin Ser Gly Pro Thr Val Arg Pro Gin Glu Asp Ala Trp Ala 

230 235 240 



726 



774 



tct cct ggt get tat gga atg ggt ggc cgt tat ccc tgg cct tea tea 
Ser Pro Gly Ala Tyr Gly Met Gly Gly Arg Tyr Pro Trp Pro Ser Ser 
245 250 255 260 



82. 



gcg ccc tea gca cca ccc ggc aat etc tac atg act gaa agt act tea 
Ala Pro Ser Ala Pro Pro Gly Asn Leu Tyr Met Thr Glu Ser Thr Ser 

265 270 275 



870 



cca tgg cct age agt ggc tct ccc cag tea ccc cct tea ccc cca gtc 
Pro Trp Pro Ser Ser Gly Ser Pro Gin Ser Pro Pro Ser Pro Pro Val 
280 285 290 



918 



cag cag ccc aag gat tct tea tac ccc tat age caa tea gat caa age 
Gin Gin Pro Lys Asp Ser Ser Tyr Pro Tyr Ser Gin Ser Asp Gin Ser 
295 300 305 



966 



atg aac egg cac aac ttt cct tgc agt gtc cat cag tac gaa tec teg 

Met Asn Arg His Asn Phe Pro Cys Ser Val His Gin Tyr Glu Ser Ser 

310 315 320 

ggg aca gtg ate aat gaa gat tea gat ctt ttg gat tec caa gtc cag 

Gly Thr Val He Asn Glu Asp Ser Asp Leu Leu Asp Ser Gin Val Gin 

325 330 335 340 

tat agt get gag cct cag ctg tat ggt aat gee acc agt gac cat ccc 

Tyr Ser Ala Glu Pro Gin Leu Tyr Gly Asn Ala Thr Ser Asp His Pro 

345 350 355 



1014 



1062 



1110 



aac aat caa gat caa agt age agt ctt cct gaa gaa tgt gta cct tea 
Asn Asn Gin Asp Gin Ser Ser Ser Leu Pro Glu Glu Cys Val Pro Ser 

360 365 370 



1158 



gat gaa agt act cct ccg agt att aaa aaa ate ata cat gtg ctg gag 
Asp Glu Ser Thr Pro Pro Ser He Lys Lys He He His Val Leu Glu 
375 380 385 



1206 



aag gtc cag tat ctt gaa caa gaa gta gaa gaa ttt gta gga aaa aag 
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Lys Val Gin Tyr Leu GIu Gin Glu Val Glu Glu Phe Val Gly Lys Lys 
390 395 400 

aca gac aaa gca cac tgg ctt ctg gaa gaa atg era acc aag gaa ctt 1302 
Thr Asp Lys Ala Tyr Trp Leu Leu Glu Glu Met Leu Thr Lys Glu Leu 
405 410 415 420 

tzg gaa ctg gat tea gtt gaa act ggg ggc eag gac tct gta egg cag 1350 
Leu Glu Leu Asp Ser Val Glu Thr Gly Gly Gin Asp Ser Val Arg Gin 
425 430 435 

gee aga aaa gag get gtt tgt aag att cag gee ata ctg gaa aaa tta 1393 
Ala Arg Lys Glu Ala Val Cys Lys lie Gin Ala lie Leu Glu Lys Leu 
440 445 450 

gaa aaa aaa gga tta tga aaggatttag aacaaagtgg aagcctgtta 1446 
Glu Lys Lys Gly Leu 
455 

ctaacttgac caaagaacac ttgattaggt taattaccct ctttttgaaa tgcctgttga 1506 

tgacaagaag caatacattc cagcttttcc tttgatttta tacttgaaaa actggcaaag 1566 

gaatggaaga atattttagt catgaagttg ttttcagttt tcagacgaat gaatgtaata 1626 

ggaaactatg gagttaccaa tattgecaag tagactcact ccttaaaaaa tttatggata 1686 

tctacaagct gcttattacc agcaggaggg aaacacactt cacacaacag gcttatcaga 1746 

aacctaccag atgaaactgg atataatttg agacaaacag gatgtgtttt tttaaacatc 1806 

tggatatctt gtcacatttt tgtacattgt gaetgettte aacatatact tcatgtgtaa 1866 

ttatagctta gaetttagee ttcttggact tctgttttgt tttgttattt gcagtttaca 1926 

aatatagtat tattctctaa aaaaaaaaaa aaaaaaaaaa 1966 



<210> 22 
<211> 457 
<212> PRT 

<213> Homo sapiens 
<400> 22 

Met Ser Ala Leu Arg Arg Ser Gly Tyr Gly Pro Ser Asp Gly Pro Ser 
15 10 15 

Tyr Gly Arg Tyr Tyr Gly Pro Gly Gly Gly Asp Val Pro Val His Pro 
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20 25 30 

Pro Pro Pro Leu Tyr Pro Leu Arg Pro Glu Pro Pro Gin Pro Pre lie 
35 4 0 4 5 

Ser Trp Arq Val Arg Gly Gly Gly Pro Ala Glu Thr Thr Trp Leu Gly 

50 55 60 

Glu Gly Gly Gly Gly Asp Gly Tyr Tyr Pro Ser Gly Gly Ala Trp Pro 
65 70 75 80 

Glu Pre Gly Arg Ala Gly Gly Ser His Gin Glu Gin Pro Pro Tyr Pro 
85 90 95 

Ser Tyr Asn Ser Asn Tyr Trp Asn Ser Thr Ala Arg Ser Arg Ala Pro 
100 105 110 

Tyr Pro Ser Thr Tyr Pro Val Arg Pro Glu Leu Gin Gly Gin Ser Leu 
115 120 125 

Asn Ser Tyr Thr Asn Gly Ala Tyr Gly Pro Thr Tyr Pro Pro Gly Pro 
130 135 140 

Gly Ala Asn Thr Ala Ser Tyr Ser Gly Ala Tyr Tyr Ala Pro Gly Tyr 
145 150 155 160 

Thr Gin Thr Ser Tyr Ser Thr Glu Val Pro Ser Thr Tyr Arg Ser Ser 
165 170 175 

Gly Asn Ser Pro Thr Pro Val Ser Arg Trp lie Tyr Pro Gin Gin Asp 
180 185 190 

Cys Gin Thr Glu Ala Pro Pre Leu Arg Gly Gin Val Pro Gly Tyr Pro 
195 200 205 

Pro Ser Gin Asn Pro Gly Met Thr Leu Pro His Tyr Pro Tyr Gly Asp 
210 215 220 

Gly Asn Arq Ser Val Pro Gin Ser Gly Pro Thr Val Arg Pro Gin Glu 
225 230 235 240 

Asp Ala Trp Ala Ser Pro Gly Ala Tyr Gly Met Gly Gly Arg Tyr Pro 
245 250 255 

Trp Pro Ser Ser Ala Pro Ser Ala Pro Pro Gly Asn Leu Tyr Met Thr 
260 265 270 

Glu Ser Thr Ser Pro Trp Pro Ser Ser Gly Ser Pro Gin Ser Pro Pro 
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275 280 285 

Ser Pro Pro Val Gin Gin Pro Lys Asp Ser Ser Tyr Pro Tyr Ser Gin 
290 295 300 

Ser Asp Gin Ser Met Asn Arg His Asn Phe Pro Cys Ser Val His Gin 
305 310 315 320 

Tyr Glu Ser Ser Gly Thr Val lie Asn Glu Asp Ser Asp Leu Leu Asp 
325 330 335 

Ser Gin Val Gin Tyr Ser Ala Glu Pro Gin Leu Tyr Gly Asn Ala Thr 
340 345 350 

Ser Asp His Pro Asn Asn Gin Asp Gin Ser Ser Ser Leu Pro Glu Glu 
355 360 365 

Cys Val Pro Ser Asp Glu Ser Thr Pro Pro Ser lie Lys Lys He He 
370 375 380 

His Val Leu Glu Lys Val Gin Tyr Leu Glu Gin Glu Val Glu Glu Phe 
385 390 395 400 

Val Gly Lys Lys Thr Asp Lys Ala Tyr Trp Leu Leu Glu Glu Met Leu 
405 410 415 

Thr Lys Glu Leu Leu Glu Leu Asp Ser Val Glu Thr Gly Gly Gin Asp 
420 425 430 

Ser Val Arg Gin Ala Arg Lys Glu Ala Val Cys Lys He Gin Ala He 
435 440 445 

Leu Glu Lys Leu Glu Lys Lys Gly Leu 
450 455 



<210> 23 

<211> 4308 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (247) . . (1590) 
<400> 23 

cccccccccc cccccccccc ccngaagacg cccggagcgg ctgctgcagc cagtagcggc 60 
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cccttcaccg gctgccccgc tcagacciag tcgggagggg tgcgaggcat gcagctgggg 120 

gcccagctcc ggtgccgcac cccgtaaagg gctgatcttc cacctcgcca cctcagccac 180 

gggacgccaa gaccgcatcc aattcagac: tcttttggtq cttgtgaaac tqaacacaac 240 

aaaagt atg gat atg gga aac caa cat cct tct att agt agg ctt cag 28 8 

Met Asp Met Gly Asn Gin His Pro Ser lie Ser Arg Leu Gin 
1 5 10 

gaa ate caa aag gaa gta aaa agt gta gaa cag caa gtt ate ggc ttc 336 
Glu lie Gin Lys Glu Val Lys Ser Val Glu Gin Gin Val lie Gly Phe 
15 20 25 30 

agt ggt ctg tea gat gac aag aat tac aag aaa ctg gag agg att eta 3 8 0 
Ser Gly Leu Ser Asp Asp Lys Asn Tyr Lys Lys Leu Glu Arg lie Leu 
35 40 45 

aca aaa cag ctt ttt gaa ata gac tct gta gat act gaa gga aaa gga 432 
Thr Lys Gin Leu Phe Glu lie Asp Ser Val Asp Thr Glu Gly Lys Gly 
50 55 60 

gat att cag caa get agg aag egg gca gca cag gag aca gaa cgt ctt 480 
Asp lie Gin Gin Ala Arg Lys Arg Ala Ala Gin Glu Thr Glu Arg Leu 
65 70 75 

etc aaa gag ttg gag cag aat gca aac cac cca cac egg att gaa ata 528 
Leu Lys Glu Leu Glu Gin Asn Ala Asn His Pro Hxs Arg lie Glu lie 
80 85 90 

cag aac att ttt gag gaa gec cag tec etc gtg aga gag aaa att gtg 576 
Gin Asn lie Phe Glu Glu Ala Gin Ser Leu Val Arg Glu Lys lie Val 
95 100 105 110 

cca ttt tat aat gga ggc aac tgc gta act gat gag ttt gaa gaa ggc 624 
Pro Phe Tyr Asn Gly Gly Asn Cys Val Thr Asp Glu Phe Glu Glu Gly 
115 120 125 

ate caa gat ate att ctg agg ctg aca cat gtt aaa act gga gga aaa 672 
lie Gin Asp lie lie Leu Arg Leu Thr His Val Lys Thr Gly Gly Lys 
130 135 140 

ate tec ttg egg aaa gca agg tat cac act tta ace aaa ate tgt gcg 720 
lie Ser Leu Arg Lys Ala Arg Tyr His Thr Leu Thr Lys lie Cys Ala 
145 150 155 

gtg caa gag ata ate gaa gac tgc atg aaa aag cag cct tec ctg ccg 768 
Val Gin Glu lie lie Glu Asp Cys Met Lys Lys Gin Pro Ser Leu Pro 
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160 165 170 

ctt tec gag gat gca cat cct tec gtt gec aaa ate aac ttc gtg atg 816 

Leu Ser Glu Asp Ala His Pro Ser Val Ala Lys lie Asn Phe Va 1 Met 

175 180 185 190 

tgt gag gtg aac aag gec cga ggg gtc ctg att gca ctt ctg atg ggt 864 

Cys Glu Val Asn Lys Ala Arg Gly Val Leu lie Ala Leu Leu Met Gly 

195 200 205 

gtg aac aac aat gag acc tgc agg cac tta tec tgt gtg etc teg ggg 912 

Val Asn Asn Asn Glu Thr Cys Arg His Leu Ser Cys Val Leu Ser Gly 

210 215 220 

ctg ate get gac ctg gat get eta gat gtg tgc ggc egg aca gaa ate 960 

Leu lie Ala Asp Leu Asp Ala Leu Asp Val Cys Gly Arg Thr Glu He 

225 230 235 

aga aat tat egg agg gag gta gta gaa gat ate aac aaa tta ttg aaa 1003 

Arg Asn Tyr Arg Arg Glu Val Val Glu Asp He Asn Lys Leu Leu Lys 

240 245 250 

tat ctg gat ttg gaa gag gaa gca gac aca act aaa gca ttt gac ctg 1056 

Tyr Leu Asp Leu Glu Glu Glu Ala Asp Thr Thr Lys Ala Phe Asp Leu 

255 260 265 270 

aga cag aat cat tec att tta aaa ata gaa aag gtc etc aag aga atg 1104 

Arg Gin Asn His Ser He Leu Lys He Glu Lys Val Leu Lys Arg Met 

275 280 285 

aga gaa ata aaa aat gaa ctt etc caa gca caa aac cct tct gaa ttg 1152 

Arg Glu He Lys Asn Glu Leu Leu Gin Ala Gin Asn Pro Ser Glu Leu 

290 295 300 

tac ctg age tec aaa aca gaa ttg cag ggt tta att gga cag ttg gat 1200 

Tyr Leu Ser Ser Lys Thr Glu Leu Gin Gly Leu lie Gly Gin Leu Asp 

305 310 315 

gag gta agt ctt gaa aaa aac ccc tgc ate egg gaa gee agg aga aga 1248 

Glu Val Ser Leu Glu Lys Asn Pro Cys lie Arg Glu Ala Arg Arg Arg 

320 325 330 

gca gtg ate gag gtg caa act ctg ate aca tat att gac ttg aag gag 1296 

Ala Val He Glu Val Gin Thr Leu He Thr Tyr He Asp Leu Lys Glu 

335 340 345 350 

gee ctt gag aaa aga aag ctg ttt get tgt gag gag cac cca tec cat 1344 

Ala Leu Glu Lys Arg Lys Leu Phe Ala Cys Glu Glu His Pro Ser His 
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355 360 365 

aaa gcc gtc tag aac gtc ctt gga aac ttg tct gag ate cag gga gaa 1392 
Lys Ala Val Trp Asp. Val Leu Gly Asn Leu Ser Glu lie Gin Giy Glu 
370 375 380 

gtt ctt tea ttt gat gga aat cga aec gat aag aac tac ate egg ctg 1440 
Val Leu Ser Phe Asp Gly Asn Arg Thr Asp Lys Asn Tyr lie Arg Leu 
385 390 395 

gaa gag ctg etc ace aag cag ctg eta gcc ctg gat get gtt gat ccg 1488 
Glu Glu Leu Leu Thr Lys Gin Leu Leu Ala Leu Asp Ala Val Asp Pro 
400 405 410 

cag gga gaa gag aag tgt aag get gcc agg aaa caa get gtg agg ctt 1536 
Gin Gly Glu Glu Lys Cys Lys Ala Ala Arg Lys Gin Ala Val Arg Leu 
415 420 425 430 

gcg cag aat att etc age tat etc gac ctg aaa tct gat gaa tgg gag 1584 
Ala Gin Asn lie Leu Ser Tyr Leu Asp Leu Lys Ser Asp Glu Trp Glu 
435 440 445 

tac tga aataccagag atctcacttt tgatactgtt ttgeacttea tatgtgcttc 1640 
Tyr 

tatgtataga gagctttcag ttcattgatt tataegtgea tatttcagtc tcagtattta 1700 

tgattgaagc aaattctatt cagtatctgc tgcttttgat gttgeaagac aaatatcatt 1760 

acagcaegtt aacttttcca tteggatcat tatctgtatg atgtggtgtg gtttgtttgg 1820 

tttgtccttt tttttgcgtt ttraatcaga aaacaaaata qaggcagctt ttgtagattt 1880 

taaatgggtt gtgeaagcat taaaatgcag gtctttcaga atctagaact aggcataacc 1940 

ttacataata ctaggaaaat tatgagaaag gggaaatttt tggttaaata agagtaaggt 2000 

tcaaacacaa gcagtacatg ttctgtttca ttatgetega tagaaggctt ttttttcact 2060 

tataaggect gattggtcct acccagctta acggggtggg gtttttttgt ttgttcagac 2120 

agtctgttct tttgtaaaca tttttagttg gaaaaacagc atetgeattt tccccatcct 2180 

etaegtttta gagaggaatc ttgtttttgt gtgeaacata agaaaattat gaaaactaat 2240 

agecaaaaaa cctttgagat tgcattaaag agaagggata aaggaccagc aataatacct 2300 

tgtaagttgc ttttgtttgt aaaatctgag cttatagttt tccttagtga gtaaattcat 2360 
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aaggatggga acatttaaat taagttaatg ggcctttaaa aaaaaaaaag gaaacactca 2420 

tacctgtagt tggaggatga atactggaga cgggttacca atgtcaggtt atactaaaac 2480 

taaatcagaa agtctgaatg tagcacataa tggttcrcrt ctgttgtcca aggctgtaaa 2540 

atggacagcc ttgtcacacc tccccggtgc tgttttacaa cgtgagggta gacgctgtca 2600 

gtaacccaga gggaccaggc cttcctaggt tttctaggca gtcagctgtt aaccactcac 2660 

ttagtaaatg tcataactac acctgctcca ggaccaatca gtgaaacctg ctcggaatta 2720 

aaggcttcct ctgggtgcct gctgaacaac tgagctcatg tcatgggcat gtggtggttt 2780 

ctctgttgcc tgaaagagcc attaaagtca gtcgtgcgtg aagcatctct cttctaaagg 2840 

atgtgtattt ccataaatgc tttctgagga tccggtacaa aatgatttcc caaagttctg 2900 

aagtgccttg agaacatgtg ggtccgagtg ttataacaga ctcctccccc gggtcacctt 2960 

ttgcctggtc atcctgttag agtacatctt tggaaatcca gggtaatatt ctctttcaga 3020 

gatgctcatt gtgtaactct gtgtagggag atagtcactt taaacagctc aaagtagcta 3080 

gctaaaggag tagccttaaa tacctaaaag atgacagaag catagccctt aacaaatctt 3140 

cagcttgtct ctcagtattt cccaatcatg aaaatccctt gctatgtctt tcctactaga 3200 

aatgttctag aatcgctgga cggtggggtc agagggcagt cggtatttag gccgtgagct 3260 

tcccatacta ctgcaggtcc aactcctggc aaccgcgggc tcaaggcagg tcattggaat 3320 

ccacgttttg gccacagtag ttgtaggatt gcttttctgt atcataattt tagaatgctc 3380 

ttaaaatctt gaggaagagt ttttattttt tatttatttt tgagatggag tctctgttgc 3440 

ccaggctgca gtgcagtggt gccatctcag ctcactgcaa cctccacctc ccaggttcaa 3500 

gcgattctcc tgcctcagcc acctgagtag ctgggagtac aggcatgtgg caccatgcct 3560 

ggctaatttt tgtattttta atagagttga gatttcacca tgatggtcag gctggtctcg 3620 

aactcctgac ctcgtgatcc gcccgcctcg gccccccaaa gtgctgggat taacgggtgt 3680 

gagccacggc gcccagccca ggaagagttt ttaaattaga gctctgttta attataccac 3740 

tgggaaatca tggttacgct tcaggcatat tcttccccag agtactactt acattttaaa 3800 
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tttcattttg taaagttaaa tgtcagcatt ccctttaaaa gtgtccattg tcctttgaaa 3860 

gtagacgttt cagtcattct tt:caaacaa gtgtctgr.gr accttttgcc aagctgtggg 392 ': 

catcgtgtgt gagtacaggg tgctcagctc ttccaccgtc attttgaatt gttcacatgg 3980 

gtaattggtc atggaaatga tcagattgac cttgattgac tgtcaggcat ggctttgttt 4040 

ctagtttcaa tctgttctcg ttccttgtac cggattattc tactcctgca atgaaccctg 4100 

ttgacaccgg atttagctct tgtcggcctt cgtggggagc tgtttgtgtt aatatgagct 4161" 

actgcatgta attcttaaac tgggcttgtc acattgtatt gtatttttgt gatctgtaat 4220 

gaaaagaatc tgtactgcaa gtaaaaccta ctccccaaaa atgtgtggct ttgggtctgc 4280 

attaaacgct gtagtccatg ttcatgcc 4308 



<210> 24 

<211> 447 

<212> PRT 

<213> Homo sapiens 

<400> 24 

Met Asp Met Gly Asn Gin His Pro Ser He Ser Arg Leu Gin Glu He 
15 10 15 

Gin Lys Glu Val Lys Ser Val Glu Gin Gin Val He Gly Phe Ser Giy 
20 25 30 

Leu Ser Asp Asp Lys Asn Tyr Lys Lys Leu Giu Arg He Leu Thr Lys 
35 40 45 

Gin Leu Phe Giu lie Asp Ser Val Asp Thr Giu Giy Lys Giy Asp He 
50 55 60 

Gin Gin Aia Arg Lys Arg Aia Ala Gin Glu Thr Glu Arg Leu Leu Lys 
65 70 75 80 

Glu Leu Glu Gin Asn Ala Asn His Pro His Arg He Giu He Gin Asn 
85 90 95 

He Phe Glu Glu Ala Gin Ser Leu Val Arg Glu Lys He Val Pro Phe 
100 105 110 

Tyr Asn Gly Gly Asn Cys Val Thr Asp Glu Phe Glu Giu Gly He Gin 
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115 120 125 

Asp lie lie Leu Arg Leu Thr His Va 1 Lys Thr Gly Gly Lys lie Ser 
130 135 140 

Leu Arg Lys Ala Arg Tyr His Thr Leu Thr Lys lie Cys Ala Val Gin 
145 150 155 160 

Glu He He Glu Asp Cys Met Lys Lys Gin Pro Ser Leu Pro Leu Ser 
165 170 175 

Glu Asp Ala His Pro Ser Val Ala Lys He Asn Phe Val Met Cys Glu 
180 185 190 

Val Asn Lys Ala Arg Gly Val Leu He Ala Leu Leu Met Gly Val Asn 
195 200 205 

Asn Asn Glu Thr Cys Arg His Leu Ser Cys Val Leu Ser Gly Leu He 
210 215 220 

Ala Asp Leu Asp Ala Leu Asp Val Cys Gly Arg Thr Glu He Arg Asn 
225 230 235 240 

Tyr Arg Arg Glu Val Val Glu Asp He Asn Lys Leu Leu Lys Tyr Leu 
245 250 255 

Asp Leu Glu Glu Glu Ala Asp Thr Thr Lys Ala Phe Asp Leu Arg Gin 
260 265 270 

Asn His Ser He Leu Lys He Glu Lys Val Leu Lys Arg Met Arg Glu 
275 280 285 

He Lys Asn Glu Leu Leu Gin Ala Gin Asn Pro Ser Glu Leu Tyr Leu 
290 295 300 

Ser Ser Lys Thr Glu Leu Gin Gly Leu He Gly Gin Leu Asp Glu Val 
305 310 315 320 

Ser Leu Glu Lys Asn Pro Cys He Arg Glu Ala Arg Arg Arg Ala Val 
325 330 335 

He Glu Val Gin Thr Leu He Thr Tyr He Asp Leu Lys Glu Ala Leu 
340 345 350 

Glu Lys Arg Lys Leu Phe Ala Cys Glu Glu His Pro Ser His Lys Ala 

355 360 365 

Val Trp Asn Val Leu Gly Asn Leu Ser Glu lie Gin Gly Glu Val Leu 
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